SEQUENCE LISTING 



(!) GENERAL INFORMATION: 



(i) 



APPLICANT: MAERTENS , GEERT 
BOSMAN, PONS 



DE MARTYNOFF, GUY 
BUYSE, MARIE-ANGE 



(ii) TITLE OF INVENTION: PURIFIED HEPATITIS C VIRUS ENVELOPE 
PROTEINS FOR DIAGNOSTIC AND THERAPEUTIC USE 

(iii) NUMBER OF SEQUENCES: 111 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NIXON & VANDERHYE P.C. 

<B) STREET: 1100 NORTH GLEBE ROAD 

(C) CITY: ARLINGTON 

(D) STATE: VIRGINIA 
{£) COUNTRY : U.S.A. 
(F) ZIP: 22201-4714 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1,25 (EPO) 



H (viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: BYRNE, THOMAS E. 

(B) REGISTRATION NUMBER: 32,205 

(C) REFERENCE /DOCKET NUMBER: 1487-10 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (703) 816-4000 

(B) TELEFAX: (703) 816-4100 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 




(vi) 



CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/612,973 

(B) FILING DATE: ll-MAR-1996 

(C) CLASSIFICATION: 



HCVENV.-T 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GGCATGCAAG CTTAATTAAT T 21 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
{iii} HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

C| (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

JgpGGGGAGGC CTGCACGTGA TCGAGGGCAG ACACCATCAC CACCATCACT AATAGTTAAT 60 

ftkACTGCA 63 

.(2) INFORMATION FOR SEQ ID NO: 3: 

C (i) SEQUENCE CHARACTERISTICS: 
r": (A) LENGTH: 642 base pairs 

5? (B) TYPE: nucleic acid 

P (C) STRANDEDNESS: single 

Q (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
(B5 LOCATION: 1..639 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..636 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA CTG TCC TGT 48 
Met Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 

CTG ACC ATT CCA GCT TCC GCT TAT GAG GTG CGC AAC GTG TCC GGG ATG 96 
Leu Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Met 
20 25 30 

TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG TAT GAG GCA 144 
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Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 40 45 

GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC GTT CGG GAG 192 
Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG CTC GCA GCT 240 
Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 

AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC GTC GAT TTG 288 
Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

CTC GTT GGG GCG GCT GCT CTC TGT TCC GCT ATG TAC GTG GGG GAT CTC 336 
Leu Val Gly Ala Ala Ala Leu Cys Ser Ala Met Tyr Val Gly Asp Leu 
O 100 105 no 

%C GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC TCG CCT CGC 384 
W/s Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr lie Ser Pro Arg 
N- 115 120 125 

CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC 432 
Ikg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
?P 130 135 140 

'ATA ACA GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT 480 
Wle Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro 
©5 150 155 160 

&GA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA CAA GCT GTC 528 
&r Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro Gin Ala Val 

j*f 165 170 175 

GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG GGC CTC GCC 576 
Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala 
180 185 190 

TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG ATT GTG ATG CTA 624 
Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He Val Met Leu 
195 200 205 

CTC TTT GCT CTC TAATAG 642 
Leu Phe Ala Leu 
210 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 
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Leu Thr lie Pro Ala Ser Ala 
20 



Tyr Glu Val Arg Asn Val Ser Gly Met 
25 30 



Tyr His Val Thr Asn Asp Cys 
35 



Ser Asn Ser Ser lie Val Tyr Glu Ala 
40 45 



Ala Asp Met lie Met His Thr 
50 55 



Pro Gly Cys Val Pro Cys Val Arg Glu 
60 



Asn Asn Ser Ser Arg Cys Trp 
65 70 



Val Ala Leu Thr Pro Thr Leu Ala Ala 
75 80 



Arg Asn Ala Ser Val Pro Thr 
85 



Thr Thr lie Arg Arg His Val Asp Leu 
90 95 



Leu Val Gly Ala Ala Ala Leu 
100 



Cys Ser Ala Met Tyr Val Gly Asp Leu 
105 110 



Cf^s Gly Ser Val Phe Leu Val 

tfl 115 



Ser Gin Leu Phe Thr lie Ser Pro Arg 
120 125 



Arg His Glu Thr Val Gin Asp 

135 



m 130 



Thr Gly His Arg Met Ala 
lift 150 



Cys Asn Cys Ser lie Tyr Pro Gly His 
140 

Trp Asp Met Met Met Asn Trp Ser Pro 
155 160 



"fife Thr Ala Leu Val Val Ser 

rt 165 



Gin Leu Leu Arg lie Pro Gin Ala Val 
170 175 



Val Asp Met Val Ala Gly Ala 

y i3o 



His Trp Gly Val Leu Ala Gly Leu Ala 
185 190 



Tfyr Tyr Ser Met Val Gly Asn 
195 



Trp Ala Lys Val Leu lie Val Met Leu 
200 205 



Leu Phe Ala Leu 
210 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..792 



(ix) FEATURE: 



(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..789 



{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr lie Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

AgA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Sir Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 
# 50 55 60 

f|G TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 
$|u Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

70 75 80 



tic GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 

b# 85 90 95 

f$T GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 

i*fr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 

U 100 105 110 

PT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 

115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 432 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

» 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 480 
Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 
Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 576 
Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 
Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA 672 
Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
210 215 220 
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CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG 720 
Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
225 230 235 240 

GGT CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG ATT 768 
Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu lie 
245 250 255 

GTG ATG CTA CTC TTT GCT CCC TAATAG 795 
Val Met Leu Leu Phe Ala Pro 
260 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
Q {A} LENGTH: 263 amino acids 

yfi (B) TYPE: amino acid 

3 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

O (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Mft Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 

;ri s io is 

vjjl Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 

. W 20 25 30 

Alk Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
H 35 40 45 

SB5KS. 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Aso Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 * 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 
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Pro Gly His He Thr Gly His Arg 
195 200 

Trp Ser Pro Thr Thr Ala Leu Val 
210 215 

Gin Ala Val Val Asp Met Val Ala 
225 230 

Gly Leu Ala Tyr Tyr Ser Met Val 
245 

Val Met Leu Leu Phe Ala Pro 
260 



# 

Met Ala Trp Asp Met Met Met Asn 
205 

Val Ser Gin Leu Leu Arg He Pro 
220 

Gly Ala His Trp Gly Val Leu Ala 
235 240 

Gly Asn Trp Ala Lys Val Leu He 
250 255 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 633 base pairs 

^ (B) TYPE: nucleic acid 

M (C) STRANDEDNESS : single 

U| (D) TOPOLOGY: linear 

|S (ii) MOLECULE TYPE: cDNA 

m (iii) HYPOTHETICAL: NO 

jh& (iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..630 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..627 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

ATG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGT GCT GCC AGA 96 
Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCT GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC ATT CCA GCT TCC GCT TAT GAG GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 
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TCC GGG ATG TAG CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 

115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACT ACG ACA ATA CGA CGC CAC 432 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

gTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 480 

S&l Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
p5 150 155 160 

l§£G GAT CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 

C%y Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr lie 
;I 165 170 175 

iCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 57 6 

Her Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser lie Tyr 
a 180 185 190 

SpC GGC CAC ATA ACA GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 

Kro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 

195 200 205 

fg3G TAATAG 633 

210 



(2) INFORMATION FOR SEQ ID NO: 8: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 



Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 



Leu Ser Cys Leu Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

0% Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 

tfl 165 170 175 

Selc Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 

bi 180 185 190 

6r§3 Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
1^ 195 200 205 

Hi 

■Sep 



@ INFORMATION FOR SEQ ID NO: 9: 

El (i) SEQUENCE CHARACTERISTICS: 
U (A) LENGTH: 483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
. (iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

{ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..480 

(ix) FEATURE: 

^ (A) NAME /KEY: mat_peptide 
(B) LOCATION: 1..477 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCC CTG CTG TCC TGT 43 
Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 

CTG ACC ATA CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG TCC GGG GTG 96 
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# 



Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

TAG CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATA GTG TAT GAG GCA 144 
Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr Glu Ala 
35 40 45 

GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC GTT CGG GAG 192 
Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

GGC AAC TCC TCC CGT TGC TGG GTG GCG CTC ACT CCC ACG CTC GCG GCC 240 
Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 

AGG AAC GCC AGC GTC CCC ACA ACG ACA ATA CGA CGC CAC GTC GAT TTG 288 
Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

ifC GTT GGG GCT GCT GCT TTC TGT TCC GCT ATG TAC GTG GGG GAT CTC 336 

Jjeu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 

C= 100 105 110 

Jffec GGA TCT GTT TTC CTT GTT TCC CAG CTG TTC ACC TTC TCA CCT CGC 384 

f%s Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
III 115 120 125 

JZGG CAT CAA ACA GTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAT 432 

3Vrg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
51 130 135 140 

fefTA TCA GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TCC TAATAG 483 

Ski Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser 
m5 150 155 160 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Leu Ser Cys 
1 5 10 15 

Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 40 45 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 
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# 



Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His Val Asp Leu 
85 90 95 

Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

Val Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser 
145 150 155 



Jg) INFORMATION FOR SEQ ID NO: 11: 

# (i) SEQUENCE CHARACTERISTICS: 

y|J (A) LENGTH: 480 base pairs 

\1 (B) TYPE: nucleic acid 

iJi (C) STRANDEDNESS : single 

5; (D) TOPOLOGY: linear 

W (ii) MOLECULE TYPE: cDNA 

y i 

s (iii) HYPOTHETICAL: NO 

C (iii) ANTI-SENSE: NO 



M (ix) FEATURE: 

H (A) NAME /KEY: CDS 

M* (B) LOCATION: 1..477 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..474 



(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ATG TCCGGT 


TGC 


TCT 


TTC 


TCT 


ATC 


TTC 


CTC 


TTG 


GCC CTG CTG TCC TGT 


48 


Met Ser Gly 


Cys 


Ser 


Phe 


Ser 


He 


Phe 


Leu 


Leu 


Ala Leu Leu Ser Cys 




1 


5 










10 




15 




CTG ACC ATA 


CCA 


GCT 


TCC 


GCT 


TAT 


GAA 


GTG 


CGC 


AAC GTG TCC GGG GTG 


96 


Leu Thr He 


Pro 


Ala 


Ser 


Ala 


Tyr 


Glu 


Val 


Arg 


Asn Val Ser Gly Val 






20 










25 






30 




TAC CAT GTC 


ACG 


AAC 


GAC 


TGC 


TCC 


AAC 


TCA 


AGC 


ATA GTG TAT GAG GCA 


144 


Tyr His Val 


Thr Asn Asp 


Cys 


Ser 


Asn 


Ser 


Ser 


He Val Tyr Glu Ala 




35 










40 








45 




GCG GAC ATG 


ATC 


ATG 


CAC 


ACC 


ccc 


GGG 


TGC 


GTG 


CCC TGC GTT CGG GAG 


192 


Ala Asp Met 


He 


Met 


His 


Thr 


Pro 


Gly 


Cys 


Val 


Pro Cys Val Arg Glu 




50 








55 










60 




GGC AAC TCC 


TCC 


CGT 


TGC 


TGG 


GTG 


GCG 


CTC 


ACT 


CCC ACG CTC GCG GCC 


240 


Gly Asn Ser 


Ser Arg Cys 


Trp 


Val 


Ala 


Leu 


Thr 


Pro Thr Leu Ala Ala 




65 






70 










75 


80 
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AGG AAC GCC AGC GTC CCC ACA ACG ACA ATA CGA CGC CAC GTC GAT TTG 288 
Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His Val Asp Leu 
85 90 95 

CTC GTT GGG GCT GCT GCT TTC TGT TCC GCT ATG TAG GTG GGG GAT GTC 336 
Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

TGC GGA TCT GTT TTC CTT GTT TCC CAG CTG TTC ACC TTC TCA CCT CGC 384 
Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

CGG CAT CAA ACA GTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAT 432 
Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser lie Tyr Pro Gly His 
130 135 140 

GTA TCA GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TAATAG 430 
fill Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 

0 s 150 155 

J%) INFORMATION FOR SEQ ID NO: 12: 

yj 

Q (i) SEQUENCE CHARACTERISTICS: 

f|| {A) LENGTH: 158 amino acids 

m (3) TYPE: amino acid 

(D) TOPOLOGY: linear 

f & (ii) MOLECULE TYPE: protein 

U (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

jpfet Ser Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 

Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 40 45 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 

50 55 60 

Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 

65 70 75 80 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

Val Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
145 150 155 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 
{B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(ix) FEATURE: 
PI (A) NAME/ KEY: CDS 

5 (B) LOCATION: 1.-633 

r\ (ix) FEATURE: 

(A) NAME /KEY : mat_peptide 
W (B) LOCATION: 1..630 

IJI (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ift : G CTG GGT AAG GCC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC CTC 48 
Ml% Leu Gly Lys Ala lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
y| 5 10 15 

Q|p GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
vSi Gly Tyr lie Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCT GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTA ACC ATT CCA GCT TCC GCT TAC GAG GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
Tyr Glu Ala Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCG GCT AGG AAC GCC AGC ATC CCC ACT ACA ACA ATA CGA CGC CAC 432 
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Leu Ala Ala Arg Asn Ala Ser lie Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 480 
Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

GGG GAT CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 
Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 576 
Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser lie Tyr 
180 185 190 

CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 
Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

#GG TAC TAATAG 640 

yrjrp Tyr 
vj 210 



; ^2) INFORMATION FOR SEQ ID NO; 14: 

; (i) SEQUENCE CHARACTERISTICS: 
\ (A) LENGTH: 210 amino acids 

II (B) TYPE: amino acid 

*jf (D) TOPOLOGY: linear 

51 (ii) MOLECULE TYPE: protein 

1 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Leu Gly Lys Ala He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser He Pro Thr Thr Thr He Arg Arg His 
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130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Vai 
145 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr lie 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

Trp Tyr 
210 



hg2) INFORMATION FOR SEQ ID NO: 15: 

=|| (i) SEQUENCE CHARACTERISTICS: 
\| (A) LENGTH: 26 base pairs 

s s i (B) TYPE: nucleic acid 

S? (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

111 (ii) MOLECULE TYPE: cDNA 

h& (iii) HYPOTHETICAL: NO 

hf (iii) ANTI-SENSE: NO 



U (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATGCCCGGTT GCTCTTTCTC TATCTT 26 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ATGTTGGGTA AGGTCATCGA TACCCT 26 
(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CTATTAGGAC CAGTTCATCA TCATATCCCA 
(2) INFORMATION FOR SEQ ID NO: 18: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

MOLECULE TYPE: cDNA 

HYPOTHETICAL: NO 

ANTI-SENSE: YES 



SEQUENCE DESCRIPTION: SEQ ID NO 
Cf ATTACCAG TTCATCATCA TATCCCA 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 







if! 


(i) 


%i 






(ii) 




(iii) 




(iii) 







(xi) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

ATACGACGCC ACGTCGATTC CCAGCTGTTC ACCATC 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 



• # 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GATGGTGAAC AGCTGGGAAT CGACGTGGCG TCGTAT 36 

(2) INFORMATION FOR SEQ ID NO: 21: 

^ eij SEQUENCE CHARACTERISTICS : 
^ (A) LENGTH: 723 base pairs 

® (B) TYPE: nucleic acid 

H (C) STRANDEDNESS: single 

bf (D) TOPOLOGY: linear 

K (ii) MOLECULE TYPE: cDNA 

W {iii} HYPOTHETICAL: NO 

M> (iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..720 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..717 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 
Met Leu Giy Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr lie Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 
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TCC 


GGG 


ATG 


TAC 


CAr 


bit- 


ACb 


AAU 


tiAC 






AAw 


TCA AGC 


7\ mm 

ATT 


GTG 


288 


Ser 


Gly 


Met 


Tyr 


Hxs 


val 


Tnr 


Asn 


ASp 


Cys 


Ser 


Asn 


Ser 


Ser 


He 


Val 










85 










90 










95 






TAT 


vaAu 


bCA 




UAU 




Al ^ 


ATT* 


t-AU 


ACC 


CCC 


cm 


TGC 


GTG 






IT/* 


Tyr 


(alU 


ax a 


ax a 


Asp 


Mei- 


T1 Q 

xxe 


weTi 


ni5 


Thr 


Pro 


biy 


Cys 


Val 


Pro 


Cys 










100 










105 










110 








GTT 


CGG 


GAG 


AAC 


AAC 


TCT 


TCC 


CGC 


TGC 


TGG 


GTA 


GCG 


CTC 


ACC 


CCC 


ACG 


384 


vax 


Arg 


vjXU 


Asn 


Asn 


Car 

oer 


oer 


Arg 


Cys 


Trp Val 


nia 


Leu 


Thr 


Pro 


Thr 






1 1 c 

11D 










ion 










125 










CTC 


GCA 


GCT 


AGG 


AAC 


GCC 


AGC 


GTC 


ccc 


ACC 


ACG 


ACA 


ATA 


CGA 


CGC 


CAC 


432 


Leu 


AXd 


Axa 


Arg 


Asn 


ax a 


Ser 


Val 


Pro 


Thr 


Thr 


1 UI 


lie Arg Arg His 






1 

iJU 




















A.** v 












GTC 


GAT 


TCC 


CAG 


CTG 


TTC 


ACC 


ATC 


TCG 


CCT 


CGC 


CGG 


CAT 


GAG 


ACG 


GTG 


480 


J» »X 


Asp 






Leu 


t lit; 


llii 


Tin 

lie 




Pro Arg 


& r*rr 
ni y 


His 


Glu 


Thr 


Val 




-•3 4 D 


















155 










160 






GAC 


TGC 


AAT 


TGC 


TCA 


ATC 


TAT 


ccc 


GGC 


CAC 


ATA 


ACG 


GGT 


CAC 


CGT 


528 


*Gin 


Asp 


Cys 


Asn 


Cys 


oer 


lie 


Tyr 


Pro 


Gly His 


xxe 


Thr Gly His Arg 




Ill 








loo 










170 










175 








GCT 


TGG 


GAT 


ATG 


ATG 


ATG 


AAC 


TGG 


TCG 


CCT 


ACA 


ACG 


GCC 


CTG 


GTG 


576 




Ala 


Trp 


Asp 


Met 


Met 


Met 


Asn 


Trp 


Ser 


Pro 


Thr 


Thr 


Ala 


Leu 


Val 










180 










185 










190 








^GTA 


TCG 


CAG 


CTG 


CTC 


CGG 


ATC 


CCA 


CAA 


GCT 


GTC 


GTG 


GAC 


ATG 


GTG 


GCG 


624 


J=^al 


Ser 


Gin 


Leu 


Leu 


Arg 


lie 


Pro 


Gin 


Ala 


Val 


vax 


Asp Met 


Val 


Ala 








iyo 




















205 










CSgg 


GCC 


CAT 


TGG 


GGA 


GTC 


CTG 


GCG 


GGT 


CTC 


GCC 


TAC 


TAT 


TCC 


ATG 


GTG 


672 




Ala 


His 


Trp 


Gly 


Val 


Leu 


Ala 


Gly 


Leu 


Ala 


Tyr 


Tyr Ser Met 


Val 






210 










215 










220 












GGG 


AAC 


TGG 


GCT 


AAG 


GTT 


TTG 


ATT 


GTG 


ATG 


CTA 


CTC 


TTT 


GCT 


CCC 


TAATAG 


723 


Gly 


Asn 


Trp 


Ala 


Lys 


Val 


Leu 


He 


Val 


Met 


Leu 


Leu 


Phe 


Ala 


Pro 






225 










230 










235 










240 





(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 
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Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 

50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
6f5 150 155 160 

, &t 

Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His lie Thr Gly His Arg 
H 165 170 175 

Mfet Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 

JfJ 180 185 190 

¥al Ser Gin Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala 
s 195 200 205 

gly Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val 
y[ 210 215 220 

My Asn Trp Ala Lys Val Leu He Val Met Leu Leu Phe Ala Pro 
M5 230 235 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .558 

(ix) FEATURE: 

(A) NAME /KEY: matjpeptide 

(B) LOCATION: 1. .555 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
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ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
1 5 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

,;€fG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 
'I&u Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
"%5 70 75 80 

*lfcC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
tgbr Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
Si 85 90 95 

{|§VT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 

; 4Vr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 

^ 100 105 110 

gTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 

§al Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
JS 115 120 125 

-ESSE, 

W£C GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 432 
feu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 480 
Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 528 
Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TAATAG 561 
Met Ala Trp Asp Met Met Met Asn Trp 
180 185 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 
'(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
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1 5 10 15 

Val Gly Tyr lie Pro Leu Val GXy Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
85 90 95 

Ty<r Glu Ala Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Vlll Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
M 115 120 125 

Igu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

til Asp Ser Gin Leu Phe Thr lie Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

<5Sn Asp Cys Asn Cys Ser lie Tyr Pro Gly His lie Thr Gly His Arg 
2 165 170 175 

Ala Trp Asp Met Met Met Asn Trp 
180 185 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 606 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..603 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..600 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
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Alb 


ttp 
1 lb 


bbi 




ctp 


atp 


CAT 


APP 


PTT 
vU 


ACA TGC GGC TTC GCC 


GAC 


CTC 


*i O 


Met 


Leu 


Gly 


Lys 


Val 


He 


Asp 


Thr 


Leu 


Thr Cys Gly Phe Ala Asp Leu 




1 




5 










10 


15 






GTG 


bbb 


lAb 


Al 1 


PPC 

bbb 


PTP 


CTP 


CCP 


CPP 


CCC CTA GGG GGC GCT 


GCC 


AGG 


-7 w 


Val 


Gly 


Tyr 


He 


Pro 


Leu 


Val 


Gly 


Ala 


Pro Leu Gly Gly Ala Ala Arg 








20 










25 


30 








GCC 


CTG 


GCG 


CAT 


GGC 


GTC 


bbb 


Gl 1 


bib 


GAG GAC GGC GTG AAC 


TAT 


GCA 


1 A A 


Ala 


Leu 


Ala 


His 


Gly 


Val 


Arg 


Val 


Leu 


Glu Asp Gly Val Asn 


Tyr Ala 








35 










40 




45 








ACA 


GGG 


AAT 


TTG 


CCC 


GG1 


ibb 


TPT 
lbl 


TTP 

TTb 


TCT ATC TTC CTC TTG 


GCT 


TTG 




Thr 


Gly 


Asn 


Leu 


Pro 


Gly 


Cys 


Ser 


Phe 


Ser He Phe Leu Leu 


Ala 


Leu 






50 










55 






60 








blb 


TPP 


TCT 
lbl 


ptc 




CTT 


CPA 


CPT 


TPP 


GCT TAT GAA GTG CGC 


AAC 


GTG 


240 


&SU 


Ser 


Cys 


Leu 


Thr 


Val 


Pro 


Ala 


Ser 


Ala Tyr Glu Val Arg Asn Val 




15 








70 








75 




80 




£^?C 


ppp 
bbb 


Alb 


tap 

1 Ab 


pa? 


CTP 


A PC 


A AP 


CAP 


TGC TCC AAC TCA AGC 


ATT 


GTG 


288 




Gly 


Met 


Tyr 


His 


Val 


Thr 


Asn 


Asp 


Cys Ser Asn Ser Ser 


lie 


Val 




UJ 




85 










90 


95 






JfFjT 


GAG 


GCA 


bbb 


bAb 


Alb 


ATP 
Al b 


ATC 
Alb 


PAP 
bAb 


ACC CCC GGG TGC GTG 


CCC 


TGC 




tfir 


Glu 


Ala 


Ala 


Asp 


Met 


He 


Met 


His 


Thr Pro Gly Cys Val 


Pro 


Cys 




m 






100 








105 


110 








(§^T 


CGG 


GAG 


AAC 


AAb 


TPT 
lbl 


TPP 

ibb 


ppp 

bbb 


TPP 
Ibb 


TGG GTA GCG CTC ACC 


CCC 


ACG 


Jul 


3 


Arg 


Glu 


Asn 


Asn 


Ser 


Ser 


Arg 


Cys 


Trp Val Ala Leu Thr 


Pro 


Thr 






115 










120 




125 










GCA 


GCT 


AGG 


AAC 


GCC 


AGC 


GTC 


CCC 


ACC ACG ACA ATA CGA 


CGC 


CAC 


432 


fiu 


Ala 


Ala 


Arg 


Asn 


Ala 


Ser 


Val 


Pro 


Thr Thr Thr He Arg Arg 


His 




p& 


130 








135 






140 








GTC 


GAT 


TCC 


CAG 


CTG 


TTC 


ACC 


ATC 


TCG 


CCT CGC CGG CAT GAG 


ACG 


GTG 


480 


Val 


Asp 


Ser 


Gin 


Leu 


Phe 


Thr 


He 


Ser 


Pro Arg Arg His Glu 


Thr 


Val 




145 








150 








155 




160 




CAb 


p^p 

bAb 


TPP 
Ibb 


n at 

AAl 


TCP 
Ibb 


TPS 


ATP 


TAT 


ppp 


GGC CAC ATA ACG GGT 


CAC 


CGT 


528 


Gin 


Asp 


Cys 


Asn 


Cys 


Ser 


He 


Tyr 


Pro 


Gly His He Thr Gly His Arg 












165 










170 


175 






ATG 


GCT 


TGG 


GAT 


ATG 


ATG 


ATG 


AAC 


TGG 


TCG CCT ACA ACG GCC 


CTG 


GTG 


576 


Met 


Ala 


Trp 


Asp 


Met 


Met 


Met 


Asn 


Trp 


Ser Pro Thr Thr Ala 


Leu 


Val 








180 










185 


190 








GTA 


TCG 


CAG 


CTG 


CTC 


CGG 


ATC 


CTC 


TAATAG 






606 


Val 


Ser 


Gin 


Leu 


Leu 


Arg 


He 


Leu 
















195 










200 













{2} INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

sSer Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
% 85 90 95 

!%r Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 

'H 100 105 110 

Sfel Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 

ffi 115 120 125 

"£feu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
f 130 135 140 

jgj&l- Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
§£5 150 155 160 

-rfsss. 

Sin Asd Cys Asn Cys Ser lie Tyr Pro Gly His lie Thr Gly His Arg 
^ * 165 170 175 

Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

Val Ser Gin Leu Leu Arg He Leu 
195 200 



{2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..633 



(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..630 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 43 
Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
1 5 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr lie Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 144 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 

W 35 40 45 

iCA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 

*©ir Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 

y 50 55 60 

igfrG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 

; 'j|eu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

Hi65 70 75 80 

p&CC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
?Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
E 85 90 95 

tfkT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
Hyr Glu Ala Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys 
W 100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 432 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
' 130 135 140 

GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 480 
Val Asd Ser Gin Leu Phe Thr lie Ser Pro Arg Arg His Glu Thr Val 
145 " 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 528 
Gin Asp Cys Asn Cys Ser lie Tyr Pro Gly His lie Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG 57 6 

Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

GTA TCG CAG CTG CTC CGG ATC GTG ATC GAG GGC AGA CAC CAT CAC CAC 624 
Val Ser Gin Leu Leu Arg lie Val lie Glu Gly Arg His His His His 
195 200 205 

CAT CAC TAATAG 636 
His His 
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210 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

til Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
■m 20 25 30 

©a Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
S T 35 40 45 

Ml? 

fir Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
m 50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

l|5 70 75 80 

fir Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
H 85 90 95 

'§r Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
2 100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

Val Ser Gin Leu Leu Arg He Val He Glu Gly Arg His His His His 
195 200 205 

His His 
210 



(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 



<ix) 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..627 

FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..624 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



is. s 

WrG GGT AAG GTC ATC GAT ACC CTT 
tjjfet Gly Lys Val lie Asp Thr Leu 

flj 1 5 

JfGG TAC ATC CCG CTC GTC GGC GCT 
?t31y Tyr lie Pro Leu Val Gly Ala 

E 20 

pTT GCG CAT GGC GTG AGG GCC CTT 
fleu Ala His Gly Val Arg Ala Leu 

pi 35 40 

^GG AAT TTG CCC GGT TGC TCC TTT 
Gly Asn Leu Pro Gly Cys Ser Phe 
50 55 

TCT TGC TTA ATT CAT CCA GCA GCT 
Ser Cys Leu lie His Pro Ala Ala 
65 70 

GGC CTC TAT GTC CTT ACC AAC GAC 
Gly Leu Tyr Val Leu Thr Asn Asp 
85 



ACG TGC GGA TTC GCC GAT CTC ATG 
Thr Cys Gly Phe Ala Aso Leu Met 
10 15 

CCC GTA GGA GGC GTC GCA AGA GCC 
Pro Val Gly Gly Val Ala Arg Ala 
25 30 

GAA GAC GGG ATA AAT TTC GCA ACA 
Glu Asp Gly lie Asn Phe Ala Thr 
45 

TCT ATT TTC CTT CTC GCT CTG TTC 
Ser lie Phe Leu Leu Ala Leu Phe 
60 

AGT CTA GAG TGG CGG AAT ACG TCT 
Ser Leu Glu Trp Arg Asn Thr Ser 
75 80 

TGT TCC AAT AGC AGT ATT GTG TAC 
Cys Ser Asn Ser Ser lie Val Tyr 
90 95 



GAG GCC GAT GAC GTT ATT CTG CAC ACA CCC GGC TGC ATA CCT TGT GTC 
Glu Ala Asp Asp Val lie Leu His Thr Pro Gly Cys lie Pro Cys Val 
100 105 110 

CAG GAC GGC AAT ACA TCC ACG TGC TGG ACC CCA GTG ACA CCT ACA GTG 
Gin Asp Gly Asn Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val 
115 120 125 

GCA GTC AAG TAC GTC GGA GCA ACC ACC GCT TCG ATA CGC AGT CAT GTG 
Ala Val Lys Tyr Val Gly Ala Thr Thr Ala Ser He Arg Ser His Val 
130 135 140 

GAC CTA TTA GTG GGC GCG GCC ACG ATG TGC TCT GCG CTC TAC GTG GGT 
Asp Leu Leu Val Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 



# 



GAC ATG TGT GGG GCT GTC TTC CTC GTG GGA CAA GCC TTC ACG TTC AGA 528 

Asp Met Cys Gly Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg 

165 170 175 

CCT CGT CGC CAT CAA ACG GTC CAG ACC TGT AAC TGC TCG CTG TAC CCA 576 

Pro Arg Arg His Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro 

180 185 190 

GGC CAT CTT TCA GGA CAT CGA ATG GCT TGG GAT ATG ATG ATG AAC TGG 624 

Gly His Leu Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 

195 200 205 

TAATAG 634 



J|2) INFORMATION FOR SEQ ID NO: 30: 

* (i) SEQUENCE CHARACTERISTICS: 

' % i (A) LENGTH: 208 amino acids 

hi (B) TYPE: amino acid 

Q (D) TOPOLOGY: linear 



;* ii?3 



{ii} MOLECULE TYPE: protein 
f (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

fj|et Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
U I 5 10 15 

Sly Tyr He Pro Leu Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala 
M 20 25 30 

Leu Ala His Gly Val Arg Ala Leu Glu Asp Gly He Asn Phe Ala Thr 
35 40 45 

Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Phe 
50 55 60 

Ser Cys Leu He His Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser 
65 70 75 80 

Gly Leu Tyr Val Leu Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr 
85 90 95 

Glu Ala Asp Asp Val He Leu His Thr Pro Gly Cys He Pro Cys Val 
100 105 110 

Gin Asp Gly Asn Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val 
115 120 125 

Ala Val Lys Tyr Val Gly Ala Thr Thr Ala Ser He Arg Ser His Val 
130 135 140 

Asp Leu Leu Val Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 

Asp Met Cys Gly Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg 
165 170 175 
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• # 



>ro Arg Arg His Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro 
180 185 190 

Gly His Leu Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 

195 200 205 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

JS (ii) MOLECULE TYPE: CDNA 
?j (iii) HYPOTHETICAL : NO 
W (iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..627 



w (ix) FEATURE: 

N* (A) NAME /KEY: mat_peptide 

p (B) LOCATION: 1..624 

o 

■"' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ATG GGT AAG GTC ATC GAT ACC CTA ACG TGC GGA TTC GCC GAT CTC ATG 48 
Met Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
15 10 15 

GGG TAT ATC CCG CTC GTA GGC GGC CCC ATT GGG GGC GTC GCA AGG GCT 96 
Gly Tyr He Pro Leu Val Gly Gly Pro He Gly Gly Val Ala Arg Ala 
20 25 30 

CTC GCA CAC GGT GTG AGG GTC CTT GAG GAC GGG GTA AAC TAT GCA ACA 144 
Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr 
35 40 45 

GGG AAT TTA CCC GGT TGC TCT TTC TCT ATC TTT ATT CTT GCT CTT CTC 192 
Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe He Leu Ala Leu Leu 
50 55 60 

TCG TGT CTG ACC GTT CCG GCC TCT GCA GTT CCC TAC CGA AAT GCC TCT 240 
Ser Cys Leu Thr Val Pro Ala Ser Ala Val Pro Tyr Arg Asn Ala Ser 
65 70 75 80 

GGG ATT TAT CAT GTT ACC AAT GAT TGC CCA AAC TCT TCC ATA GTC TAT 288 
31y He Tyr His Val Thr Asn Asp Cys Pro Asn Ser Ser He Val Tyr 
85 90 95 

GAG GCA GAT AAC CTG ATC CTA CAC GCA CCT GGT TGC GTG CCT TGT GTC 336 
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Glu Ala Asp Asn Leu lie Leu His Ala Pro Gly Cys Val Pro Cys Val 
100 105 110 

ATG ACA GGT AAT GTG AGT AGA TGC TGG GTC CAA ATT ACC CCT ACA CTG 384 
Met Thr Gly Asn Val Ser Arg Cys Trp Val Gin lie Thr Pro Thr Leu 
115 120 125 

TCA GCC CCG AGC CTC GGA GCA GTC ACG GCT CCT CTT CGG AGA GCC GTT 432 
Ser Ala Pro Ser Leu Gly Ala Val Thr Ala Pro Leu Arg Arg Ala Val 
130 135 140 

GAC'TAC CTA GCG GGA GGG GCT GCC CTC TGC TCC GCG TTA TAC GTA GGA 480 
Asp Tyr Leu Ala Gly Gly Ala Ala Leu Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 

GAC GCG TGT GGG GCA CTA TTC TTG GTA GGC CAA ATG TTC ACC TAT AGG 528 
Asp Ala Cys Gly Ala Leu Phe Leu Val Gly Gin Met Phe Thr Tyr Arg 
Q 165 170 175 



JgCT CGC CAG CAC GCT ACG GTG CAG AAC TGC AAC TGT TCC ATT TAC AGT 576 
Sro Arg Gin His Ala Thr Val Gin Asn Cys Asn Cys Ser lie Tyr Ser 
180 185 190 

j|bc CAT GTT ACC GGC CAC CGG ATG GCA TGG GAT ATG ATG ATG AAC TGG 624 
Wky His Val Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 

Iff 195 200 205 

ijAATAG 630 



fJ2) INFORMATION FOR SEQ ID NO: 32: 

^ (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
15 10 15 

Gly Tyr He Pro Leu Val Gly Gly Pro He Gly Gly Val Ala Arg Ala 
20 25 30 

Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr 
35 40 45 

Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe He Leu Ala Leu Leu 
50 55 60 

Ser Cys Leu Thr Val Pro Ala Ser Ala Val Pro Tyr Arg Asn Ala Ser 
65 70 75 80 



Gly He Tyr His Val Thr Asn Asp Cys Pro Asn Ser Ser He Val Tyr 
85 90 95 



Glu Ala Asp Asn Leu lie Leu His Ala Pro Giy Cys Val Pro Cys Val 
100 105 110 

Met Thr Gly Asn Val Ser Arg Cys Trp Val Gin lie Thr Pro Thr Leu 
115 120 125 

Ser Ala Pro Ser Leu Gly Ala Val Thr Ala Pro Leu Arg Arg Ala Val 
130 135 140 

Asp Tyr Leu Ala Gly Gly Ala Ala Leu Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 

Asp Ala Cys Gly Ala Leu Phe Leu Val Gly Gin Met Phe Thr Tyr Arg 
165 170 175 

Pro Arg Gin His Ala Thr Val Gin Asn Cys Asn Cys Ser lie Tyr Ser 
180 185 190 

ply His Val Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 

.1 195 200 205 



|||) INFORMATION FOR SEQ ID NO: 33: 

f (i) SEQUENCE CHARACTERISTICS: 
M* (A) LENGTH: 23 base pairs 

0 (B) TYPE: nucleic acid 

M: <C) STRANDEDNESS : single 

p (D) TOPOLOGY: linear 

H (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



* {xi} SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TGGGATATGA TGATGAACTG GTC 23 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



• 



CTATTATGGT GGTAAGCCAC AGAGCAGGAG 30 

(2} INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1476 base pairs 
(S) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



::,f! (ix) FEATURE: 

X (A) NAME/KEY: CDS 

Cl (3) LOCATION: 1..1473 

^ (ix) FEATURE: 

P (A) NAME /KEY: matjpeptide 

fU (B) LOCATION: 1..1470 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ff§G GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG 48 

tip Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser 

Ell 5 10 15 

<p|G CTG CTC CGG ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC 96 

Gin Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala 

20 25 30 



CAT 


TGG 


GGA 


GTC 


CTG 


GCG 


GGC 


CTC 


GCC 


TAC 


TAT 


TCC 


ATG GTG GGG 


AAC 


144 


His 


Trp 


Gly 


Val 


Leu 


Ala 


Gly 


Leu 


Ala 


Tyr 


Tyr 


Ser 


Met Val Gly 


Asn 








35 










40 










45 






TGG 


GCT 


AAG 


GTT 


TTG 


GTT 


GTG 


ATG 


CTA 


CTC 


TTT 


GCC 


GGC GTC GAC 


GGG 


192 


Trp 


Ala 


Lys 


Val 


Leu 


Val 


Val 


Met 


Leu 


Leu 


Phe 


Ala 


Gly Val Asp 


Gly 




50 










55 










60 








CAT 


ACC 


CGC 


GTG 


TCA 


GGA 


GGG 


GCA 


GCA 


GCC 


TCC 


GAT 


ACC AGG GGC 


CTT 


240 


His 


Thr 


Arg 


Val 


Ser 


Gly 


Gly 


Ala 


Ala 


Ala 


Ser 


Asp 


Thr Arg Gly 


Leu 




65 










70 










75 






80 




GTG 


TCC 


CTC 


TTT 


AGC 


CCC 


GGG 


TCG 


GCT 


CAG 


AAA 


ATC 


CAG CTC GTA 


AAC 


288 


Val 


Ser 


Leu 


Phe 


Ser 


Pro 


Gly 


Ser 


Ala 


Gin 


Lys 


He 


Gin Leu Val 


Asn 












85 








90 






95 






ACC 


AAC 


GGC 


AGT 


TGG 


CAC 


ATC 


AAC 


AGG 


ACT 


GCC 


CTG 


AAC TGC AAC 


GAC 


-336 


Thr 


Asn 


Gly 


Ser 


Trp 


His 


He 


Asn 


Arg 


Thr 


Ala 


Leu 


Asn Cys Asn 


Asp 








100 










105 








110 






TCC 


CTC 


CAA 


ACA 


GGG 


TTC 


TTT 


GCC 


GCA 


CTA 


TTC 


TAC 


AAA CAC AAA 


TTC 


384 


Ser 


Leu 


Gin 


Thr 


Gly 


Phe 


Phe 


Ala 


Ala 


Leu 


Phe 


Tyr 


Lys His Lys 


Phe 








115 










120 










125 






AAC 


TCG 


TCT 


GGA 


TGC 


CCA 


GAG 


CGC 


TTG 


GCC 


AGC 


TGT 


CGC TCC ATC 


GAC 


432 
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Asn 


Ser 


Ser 


Gly 


Cys 


Pro 


Glu 


Arg 


Leu 


Ala 


Ser 


Cys 


Arg 


Ser 


He 


Asp 






130 










135 










140 












AAG 


TTC 


GCT 


CAG 


GGG 


TGG 


GGT 


CCC 


CTC 


ACT 


TAC 


ACT 


GAG 


CCT 


AAC 


AGC 


480 


Lys 


Phe 


Ala 


Gin 


Gly 


Trp 


Gly 


Pro 


Leu 


Thr 


Tyr 


Thr 


Glu 


Pro 


Asn 


Ser 




145 










150 










155 










160 




TCG 


GAC 


CAG 


AGG 


CCC 


TAC 


TGC 


TGG 


CAC 


TAC 


GCG 


CCT 


CGA 


CCG 


TGT 


GGT 


528 


Ser 


Asp 


Gin 


Arg 


Pro 


Tyr 


Cys 


Trp 


His 


Tyr 


Ala 


Pro 


Arg 


Pro 


Cys 


Gly 












165 










170 










175 






ATT 


GTA 


CCC 


GCG 


TCT 


CAG 


GTG 


TGC 


GGT 


CCA 


GTG 


TAT 


TGC 


TTC 


ACC 


CCG 


576 


lie 


Val 


Pro 


Ala 


Ser 


Gin 


Val 


Cys 


Gly 


Pro 


Val 


Tyr 


Cys 


Phe 


Thr 


Pro 










180 










185 










190 








AGG 


CCT 


GTT 


GTG 


GTG 


GGG 


ACG 


ACC 


GAT 


CGG 


TTT 


GGT 


GTC 


CCC 


ACG 


TAT 


624 


Ser 


Pro 


Val 


Val 


Val 


Gly 


Thr 


Thr 


Asp 


Arg 


Phe 


Gly 


Val 


Pro 


Thr 


Tyr 




PI 




195 










200 










205 










)kkc 


TGG 


GGG 


GCG 


AAC 


GAC 


TCG 


GAT 


GTG 


CTG 


ATT 


CTC 


AAC 


AAC 


ACG 


CGG 


672 




Trp 


Gly 


Ala 


Asn 


Asp 


Ser 


Asp 


Val 


Leu 


He 


Leu 


Asn 


Asn 


Thr 


Arg 






210 










215 










220 












3FC(Z 


cccz 


CGA 




AAC 


TGG 


TTC 


GGC 


TGT 


ACA 


TGG 


ATG 


AAT 


GGC 


ACT 


GGG 


720 




Pro 


Arg 


Gly 


Asn 


Trp 


Phe 


Gly 


Cys 


Thr 


Trp 


Met 


Asn 


Gly 


Thr 


Gly 




||5 










230 










235 










240 




^TC 


AGG 


AAG 


ACG 


TGT 


GGG 


GGC 


CCC 


CCG 


TGC 


AAC 


ATC 


GGG 


GGG 


GCC 


GGC 


768 


She 


Thr 


Lys 


Thr 


Cys 


Gly 


Gly 


Pro 


Pro 


Cys 


Asn 


He 


Gly 


Gly 


Ala 


Gly 




CI 








245 










250 










255 






^AAC 


AAC 


ACC 


TTG 


ACC 


TGC 


CCC 


ACT 


GAC 


TGT 


TTT 


CGG 


AAG 


CAC 


CCC 


GAG 


816 




Asn 


Thr 


Leu 


Thr 


Cys 


Pro 


Thr 


Asp 


Cys 


Phe 


Arg 


Lys 


His 


Pro 


Glu 








260 










265 










270 










ACC 


TAC 


GCC 


AGA 


TGC 


GGT 


TCT 


GGG 


CCC 


TGG 


CTG 


ACA 


CCT 


AGG 


TGT 


864 


Ala 


Thr 


Tyr 


Ala 


Arg 


Cys 


Gly 


Ser 


Gly 


Pro 


Trp 


Leu 


Thr 


Pro 


Arg 


Cys 








275 










280 










285 










ATG 


GTT 


CAT 


TAC 


CCA 


TAT 


AGG 


CTC 


TGG 


CAC 


TAC 


CCC 


TGC 


ACT 


GTC 


AAC 


912 


Met 


Val 


His 


Tyr 


Pro 


Tyr 


Arg 


Leu 


Trp 


His 


Tyr 


Pro 


Cys 


Thr 


Val 


Asn 






290 










295 










300 












TTC 


ACC 


ATC 


TTC 


AAG 


GTT 


AGG 


ATG 


TAC 


GTG 


GGG 


GGC 


GTG 


GAG 


CAC 


AGG 


960 


Phe 


Thr 


He 


Phe 


Lys 


Val 


Arg 


Met 


Tyr 


Val 


Gly 


Gly 


Val 


Glu 


His 


Arg 




305 










310 










315 










320 




TTP 


GAA 


gcc 


GCA 


TGC 


AAT 


TGG 


ACT 


CGA 


GGA 


GAG 


CGT 


TGT 


GAC 


TTG 


GAG 


1008 


Phe 


Glu 


Ala 


Ala 


Cys 


Asn 


Trp 


Thr 


Arg 


Gly 


Glu 


Arg 


Cys 


Asp 


Leu 


Glu 












325 










330 










335 










GAT 


aga 


TCA 


gag 


CTT 
\+ x x 


AGC 


CCG 


CTG 




CTG 


TCT 


ACA 


ACA 


GAG 


1056 


Asp 


Arg 


Asp 


Arg 


Ser 


Glu 


Leu 


Ser 


Pro 


Leu 


Leu 


Leu 


Ser 


Thr 


Thr 


Glu 






340 










345 










350 








TGG 


CAG 


ATA 


CTG 


CCC 


TGT 


TCC 


TTC 


ACC 


ACC 


CTG 


CCG 


GCC 


CTA 


TCC 


ACC 


1104 


Trp 


Gin 


lie 


Leu 


Pro 


Cys 


Ser 


Phe 


Thr 


Thr 


Leu 


Pro 


Ala 


Leu 


Ser 


Thr 






355 








360 










365 










GGC 


CTG 


ATC 


CAC 


CTC 


CAT 


CAG 


AAC 


ATC 


GTG 


GAC 


GTG 


CAA 


TAC 


CTG 


TAC 


1152 


Gly 


Leu 


He 


His 


Leu 


His 


Gin 


Asn 


He 


Val 


Asp 


Val 


Gin 


Tyr 


Leu 


Tyr 




370 










375 










380 
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GGT GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC AAA TGG GAG TAT GTC 1200 
Gly Val Gly Ser Ala Val Val Ser Leu Val lie Lys Trp Glu Tyr Val 
385 390 395 400 

CTG TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC ATC TGC GCC TGC TTA 1248 
Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg lie Cys Ala Cys Leu 
405 410 415 

TGG ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC GCC TTA GAG AAC CTG 1296 
Trp Met Met Leu Leu lie Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu 
420 425 430 

GTG GTC CTC AAT GCG GCG GCC GTG GCC GGG GCG CAT GGC ACT CTT TCC 1344 
Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser 
435 440 445 

TTC CTT GTG TTC TTC TGT GCT GCC TGG TAC ATC AAG GGC AGG CTG GTC 1392 

iFhe Leu Val Phe Phe Cys Ala Ala Trp Tyr lie Lys Gly Arg Leu Val 

:J| 450 455 460 

ffibT GGT GCG GCA TAC GCC TTC TAT GGC GTG TGG CCG CTG CTC CTG CTT 1440 

j?to Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu 

m5 470 475 480 

IjrG CTG GCC TTA CCA CCA CGA GCT TAT GCC TAGTAA 147 6 

ifeu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
485 490 

32} INFORMATION FOR SEQ ID NO: 36: 

- 

hk. (i) SEQUENCE CHARACTERISTICS: 

Q (A) LENGTH: 4 90 amino acids 

m (B) TYPE: amino acid 

ff (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Trp Aso Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser 
.1 "5 10 15 

Gin Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala 
20 25 30 

His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn 
35 40 45 

Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly 
50 55 60 

His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu 
65 70 75 80 

Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn 
85 90 95 

Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp 
100 105 110 
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Ser 


Leu 


Gin 
115 


Thr 


Gly 


Phe 


Phe 


Ala 
120 


Ala 


Leu 


Phe 


Tyr 


Lys 
125 


His 


Lys 


Phe 


Asn 


Ser 
130 


*Ser 


Gly 


Cys 


Pro 


Glu 
135 


Arg 


Leu 


Ala 


Ser 


Cys 
140 


Arg 


Ser 


He 


Asp 


Lys 
145 


Phe 


Ala 


Gin 


Gly 


Trp 
150 


Gly 


Pro 


Leu 


Thr 


Tyr 
155 


Thr 


Glu 


Pro 


Asn 


Ser 
160 


Ser 


Asp 


Gin 


Arg 


Pro 
165 


Tyr 


Cys 


Trp 


His 


Tyr 
170 


Ala 


Pro 


Arg 


Pro 


Cys 
175 


Gly 


lie 


Val 


Pro 


Ala 
180 


Ser 


Gin 


Val 


Cys 


Gly 
185 


Pro 


Val 


Tyr 


Cys 


Phe 
190 


Thr 


Pro 


Ser 


Pro 


Val 
195 


Val 


Val 


Gly 


Thr 


Thr 
200 


Asp 


Arg 


Phe 


Gly 


Val 
205 


Pro 


Thr 


Tyr 


A^n 


Tro 
210 


Gly 


Ala 


Asn 


Asp 


Ser 
215 


Asp 


Val 


Leu 


He 


Leu 
220 


Asn 


Asn 


Thr 


Arg 


m J 


Pro 


Arg 


Gly 


Asn 


Trp 
230 


Phe 


Gly 


Cys 


Thr 


Trp 
235 


Met 


Asn 


Gly 


Thr 


Gly 
240 


fet e 

s y 


Thr 


Lys 


Thr 


Cys 
245 


Gly 


Gly 


Pro 


Pro 


Cys 
250 


Asn 


He 


Gly 


Gly 


Ala 
255 


Gly 


&sn 


Asn 


Thr 


Leu 
260 


Thr 


Cys 


Pro 


Thr 


Asp 
265 


Cys 


Phe 


Arg 


Lys 


His 
270 


Pro 


Glu 




Thr 


Tyr 
275 


Ala 


Arg 


Cys 


Gly 


Ser 
280 


Gly 


Pro 


Trp 


Leu 


Thr 
285 


Pro 


Arg 


Cys 


lit 


Val 
290 


His 


Tyr 


Pro 


Tyr 


Arg 
295 


Leu 


Trp 


His 


Tyr 


Pro 
300 


Cys 


Thr 


Val 


Asn 


Phe 
305 


Thr 


He 


Phe 


Lys 


Val 
310 


Arg 


Met 


Tyr 


Val 


Gly 
315 


Gly 


Val 


Glu 


His 


Arg 
320 


Phe 


Glu 


Ala 


Ala 


Cys 
325 


Asn 


Trp 


Thr 


Arg 


Gly 
330 


Glu 


Arg 


Cys 


Asp 


Leu 
335 


Glu 


Asp 


Arg 


Asp 


Arg 
340 


Ser 


Glu 


Leu 


Ser 


Pro 
345 


Leu 


Leu 


Leu 


Ser 


Thr 
350 


Thr 


Glu 


Trp 


Gin 


He 
355 


Leu 


Pro 


Cys 


Ser 


Phe 
360 


Thr 


Thr 


Leu 


Pro 


Ala 
365 


Leu 


Ser 


Thr 


Gly 


Leu 


He 


His 


Leu 


His 


Gin 
375 


Asn 


He 


Val 


Asp 


Val 
380 


Gin 


Tyr 


Leu 


Tyr 


Gly 


Val 


Gly 


Ser 


Ala 


Val 
390 


Val 


Ser 


Leu 


Val 


He 
395 


Lys 


Trp 


Glu 


Tyr 


Val 
400 


Leu 


Leu 


Leu 


Phe 


Leu 
405 


Leu 


Leu 


Ala 


Asp 


Ala 
410 


Arg 


He 


Cys 


Ala 


Cys 
415 


Leu 


Trp 


Met 


Met 


Leu 
420 


Leu 


He 


Ala 


Gin 


Ala 
425 


Glu 


Ala 


Ala 


Leu 


Glu 
430 


Asn 


Leu 


Val 


Val 


Leu 


Asn 


Ala 


Ala 


Ala 


Val 


Ala 


Gly 


Ala 


His 


Gly 


Thr 


Leu 


Ser 



435 440 445 

Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr lie Lys Gly Arg Leu Val 
450 455 460 

Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu 
465 470 475 480 

Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
485 490 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: cDNA 
Sf {iii) HYPOTHETICAL: NO 
5 (iii) ANTI-SENSE: NO 



W <ix) FEATURE : 



* (A) NAME /KEY: CDS 

y, (B) LOCATION:. 2. . 1018 

[? <ix) FEATURE: 

L~ {A) NAME/KEY: mat_peptide 

Q (B) LOCATION: 2. ,1015 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

G ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA 46 
He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly 
15 10 15 

GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG 94 
Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys 
20 25 30 

GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC 142 
Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg 
35 40 45 

GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC 190 
Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu 
50 55 60 

TTT AGC CCC GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC AAC GGC 238 
Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly 
65 70 75 

AGT TGG CAC ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC CTC CAA 286 
Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin 
80 85 90 95 
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ACA GGG TTC TTT GCC GCA CTA TTC TAC AAA CAC AAA TTC AAC TCG TCT 334 
Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser 
100 105 110 

GGA TGC CCA GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG TTC GCT 382 
Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser lie Asp Lys Phe Ala 
115 120 125 

CAG GGG TGG GGT CCC CTC ACT TAC ACT GAG CCT AAC AGC TCG GAC CAG 430 
Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin 
130 135 140 

AGG CCC TAC TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT ATT GTA CCC 478 
Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly lie Val Pro 
145 150 155 

GCG TCT CAG GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG AGC CCT GTT 526 
Ma Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val 
[||0 165 170 175 

ifG GTG GGG ACG ACC GAT CGG TTT GGT GTC CCC ACG TAT AAC TGG GGG 574 
Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly 

W 180 185 190 

IpG AAC GAC TCG GAT GTG CTG ATT CTC AAC AAC ACG CGG CCG CCG CGA 622 

Mia Asn Aso Ser Asp Val Leu lie Leu Asn Asn Thr Arg Pro Pro Arg 
^ ~ 195 200 205 

pSC AAC TGG TTC GGC TGT ACA TGG ATG AAT GGC ACT GGG TTC ACC AAG 670 

biy Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys 

U 210 215 220 

ICG TGT GGG GGC CCC CCG TGC AAC ATC GGG GGG GCC GGC AAC AAC ACC 718 

fflhr Cys Gly Gly Pro Pro Cys Asn lie Gly Gly Ala Gly Asn Asn Thr 

p 225 230 235 

TTG ACC TGC CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG GCC ACC TAC 766 

Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr 

240 245 250 255 

GCC AGA TGC GGT TCT GGG CCC TGG CTG ACA CCT AGG TGT ATG GTT CAT 814 
Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His 
260 265 270 

TAC CCA TAT AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTC ACC ATC 862 
Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr lie 
275 280 285 

TTC AAG GTT AGG ATG TAC GTG GGG GGC GTG GAG CAC AGG TTC GAA GCC 910 
Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala 
290 295 300 

GCA TGC AAT TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC AGG GAT 958 
Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp 
305 310 315 

AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG CAG AGT 1006 
Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Ser 
320 325 330 335 

GGC AGA GCT TAATTA 1021 
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Gly Arg Ala 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 338 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



He 
1 


Pro Gin 


Ala 


Val 
5 


Val 


Asp 


Met 


Val 


Ala 
10 


Gly 


Ala His 


Trp Gly 
15 


Val 


llu 


Ala Gly 


Leu 
20 


Ala 


Tyr 


Tyr 


Ser 


Met 
25 


Val 


Gly 


Asn Trp 


Ala Lys 
30 


Val 




Val Val 
35 


Met 


Leu 


Leu 


Phe 


Ala 
40 


Gly 


Val 


Asp 


Gly His 
45 


Thr Arg 


Val 


Sfer 


Gly Gly 
50 


Ala 


Ala 


Ala 


Ser 
55 


Asp 


Thr 


Arg 


Gly 


Leu Val 
60 


Ser 


Leu 


Phe 


I H 

3er 

" ;; 65 


Pro Gly 


Ser 


Ala 


Gin 
70 


Lys 


He 


Gin 


Leu 


Val 
75 


Asn Thr 


Asn Gly 


Ser 
80 


pfep 


His He 


Asn 


Arg 
85 


Thr 


Ala 


Leu 


Asn 


Cys 
90 


Asn 


Asp Ser 


Leu 


Gin 
95 


Thr 


iy 


Phe Phe 


Ala 
100 


Ala 


Leu 


Phe 


Tyr 


Lys 
105 


His 


Lys 


Phe Asn 


Ser 
110 


Ser 


Gly 


Cys 


Pro Glu 
115 


Arg 


Leu 


Ala 


Ser 


Cys 
120 


Arg 


Ser 


He 


Asp Lys 
125 


Phe 


Ala 


Gin 


Gly 


Trp Gly 
130 


Pro 


Leu 


Thr 


Tyr 
135 


Thr 


Glu 


Pro 


Asn 


Ser Ser 
140 


Asp 


Gin 


Arg 


Pro 
145 


Tyr Cys 


Trp 


His 


Tyr 
150 


Ala 


Pro 


Arg 


Pro 


Cys 
155 


Gly He 


Val 


Pro 


Ala 
160 


Ser 


Gin Val 


Cys 


Gly 
165 


Pro 


Val 


Tyr 


Cys 


Phe 
170 


Thr 


Pro Ser 


Pro 


Val 
175 


Val 


Val 


Gly Thr 


Thr 
180 


Asp 


Arg 


Phe 


Gly 


Val 
185 


Pro 


Thr 


Tyr Asn 


Trp Gly 
190 


Ala 


Asn 


Asp Ser 
195 


Asp 


Val 


Leu 


He 


Leu 
200 


Asn 


Asn 


Thr 


Arg Pro 
205 


Pro Arg 


Gly 


Asn 


Trp Phe 
210 


Gly 


Cys 


Thr 


Trp 
215 


Met 


Asn 


Gly 


Thr 


Gly Phe 
220 


Thr Lys 


Thr 


Cys 
225 
Thr 


Gly Gly 
Cys Pro 


Pro 
Thr 


Pro 

Asp 
245 


Cys 
230 
Cys 


Asn 
Phe 


He 
Arg 


Gly 
Lys 


Gly 

His 
250 


Ala 
235 
Pro 


Gly Asn 
Glu Ala 


Asn Thr 

Thr Tyr 
255 


Leu 
240 
'Ala 
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Arg Cys Gly Ser GXy Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr 
260 265 270 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr lie Phe 
275 280 285 

Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala 
290 295 300 

Gys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 
305 310 315 320 

Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Ser Gly 
325 330 335 

Arg Ala 

#) INFORMATION FOR SEQ ID NO: 39: 

M (i) - SEQUENCE CHARACTERISTICS: 
H..1 {A) LENGTH: 1034 base pairs 

22 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



T (ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
H 5 (iii) ANTI-SENSE: NO 



(ix) FEATURE: 

{A} NAME /KEY: CDS 

(B) LOCATION: 2.. 1032 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(3) LOCATION: 2.. 1029 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

G ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA 4 6 

lie Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly 
15 10 15 

GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG 94 
Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys 
20 25 30 

GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC 142 
Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg 
35 40 45 

GTG TCA GGA GGG GCA GGA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC 190 
Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu 
50 55 60 
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# 



TTT AGC 


CCC 


GGG 


TCG 


GCT 


CAG 


AAA 


ATC 


CAG 


CTC 


GTA 


AAC 


ACC 


AAC 


GGC 


238 


Phe Ser 


Pro 


Gly 


Ser 


Ala 


Gin 


Lys 


He 


Gin 


Leu 


Val 


Asn 


Thr 


Asn 


Gly 




65 










70 










75 










AGT TGG 


CAC 


ATC 


AAC 


AGG 


ACT 


GCC 


CTG 


AAC 


TGC 


AAC 


GAC 


TCC 


CTC 


CAA 


286 


Ser Trp 


His 


He 


Asn 


Arg 


Thr 


Ala 


Leu 


Asn 


Cys 


Asn 


Asp 


Ser 


Leu 


Gin 




80 








85 










90 










95 




ACA GGG 

^\W±V WWW 


TTC 


TTT 


GCC 


GCA 


CTA 


TTC 


TAC 


AAA 


CAC 


AAA 


TTC 


AAC 


TCG 


TCT 


334 


Thr Gly 


Phe 


Phe 


Ala 


Ala 


Leu 


Phe 


Tyr 


Lys 


His 


Lys 


Phe 


Asn 


Ser 


Ser 








100 










105 










110 






GGA TGC 
uun J- ww 


CCA 

W W-ti 


GAG 


CGC 


TTG 


GCC 


AGC 


TGT 


CGC 


TCC 


ATC 


GAC 


AAG 


TTC 


GCT 


382 


Gly Cys 


Pro 


Glu 


Arg 


Leu 


Ala 


Ser 


Cys 


Arg 


Ser 


He 


Asp 


Lys 


Phe 


Ala 








115 










120 










125 








CAG GGG 

wAw www 


TGG 


GGT 


CCC 


CTC 


ACT 


TAC 


ACT 


GAG 


CCT 


AAC 


AGC 


TCG 


GAC 


CAG 


430 


Gin Gly 


Trp 


Gly 


Pro 


Leu 


Thr 


Tyr 


Thr 


Glu 


Pro 


Asn 


Ser 


Ser 


Asp 


Gin 






130 










135 










140 










jSSG CCC 


TAC 


TGC 


TGG 


CAC 


TAC 


GCG 


CCT 


CGA 


CCG 


TGT 


GGT 


ATT 


GTA 


CCC 


478 


lifg Pro 


Tyr 


Cys 


Trp 


His 


Tyr 


Ala 


Pro 


Arg 


Pro 


Cys 


Gly 


He 


Val 


Pro 




Tl 145 










150 










155 












fitG tct 


CAG 


GTG 


TGC 


GGT 


CCA 


GTG 


TAT 


TGC 


TTC 


ACC 


CCG 


AGC 


CCT 


GTT 


526 


la Ser 


Gin 


Val 


Cys 


Gly 


Pro 


Val 


Tyr 


Cys 


Phe 


Thr 


Pro 


Ser 


Pro 


Val 












165 










170 










175 




6tg gtg 

VJ i W W -b W 


GGG 


ACG 


ACC 


GAT 


CGG 


TTT 


GGT 


GTC 


CCC 


ACG 


TAT 


AAC 


TGG 


GGG 


574 


til Val 


Gly 


Thr 


Thr 


Asp 


Arg 


Phe 


Gly 


Val 


Pro 


Thr 


Tyr 


Asn 


Trp 


Gly 










180 










185 










190 






p^G AAC 


GAC 


TCG 


GAT 


GTG 


CTG 


ATT 


CTC 


AAC 


AAC 


ACG 


CGG 


CCG 


CCG 


CGA 


622 


Ala Asn 


Asp 


Ser 


Asp 


Val 


Leu 


He 


Leu 


Asn 


Asn 


Thr 


Arg 


Pro 


Pro 


Arg 








195 










200 










205 








GGC AAC 


TGG 


TTC 


GGC 


TGT 


ACA 


TGG 


ATG 


AAT 


GGC 


ACT 


GGG 


TTC 


ACC 


AAG 


670 


Gly Asn 


Trp 


Phe 


Gly 


Cys 


Thr 


Trp 


Met 


Asn 


Gly 


Thr 


Gly 


Phe 


Thr 


Lys 






210 










215 










220 










ACG TGT 


GGG 


GGC 


CCC 


CCG 


TGC 


AAC 


ATC 


GGG 


GGG 


GCC 


GGC 


AAC 


AAC 


ACC 


718 


Thr Gys 


Gly 


Gly 


Pro 


Pro 


Cys 


Asn 


He 


Gly 


Gly 


Ala 


Gly 


Asn 


Asn 


Thr 




225 










230 










235 












TTG ACC 


TGC 


CCC 


ACT 


GAC 


TGT 


TTT 


CGG 


AAG 


CAC 


CCC 


GAG 


GCC 


ACC 


TAC 


766 


Leu Thr 


Cys 


Pro 


Thr 


Asp 


Cys 


Phe 


Arg 


Lys 


His 


Pro 


Glu 


Ala 


Thr 


Tyr 




240 








245 










250 










255 




GCC AGA 


TGC 


GGT 


TCT 


GGG 


CCC 


TGG 


CTG 


ACA 


CCT 


AGG 


TGT 


ATG 


GTT 


CAT 


814 


Ala Arg 


Cys 


Gly 


Ser 


Gly 


Pro 


Trp 


Leu 


Thr 


Pro 


Arg 


Cys 


Met 


Val 


His 










260 










265 










270 






TAC CCA 

X WW.T1. 


TAT 


AGG 


CTC 


TGG 


CAC 


TAC 


CCC 


TGC 


ACT 


GTC 


AAC 


TTC 


ACC 


ATC 


862 


Tyr Pro 


Tyr 


Arg 


Leu 


Trp 


His 


Tyr 


Pro 


Cys 


Thr 


Val 


Asn 


Phe 


Thr 


He 






275 










280 










285 








TTC AAG 


GTT 


AGG 


ATG 


TAC 


GTG 


GGG 


GGC 


GTG 


GAG 


CAC 


AGG 


TTC 


GAA 


GCC 


910 


Phe Lys 


Val 


Arg 


Met 


Tyr 


Val 


Gly 


Gly 


Val 


Glu 


His 


Arg 


Phe 


Glu 


Ala 






290 










295 










300 










GCA TGC 


AAT 


TGG 


ACT 


CGA 


GGA 


GAG 


CGT 


TGT 


GAC 


TTG 


GAG 


GAC 


AGG 


GAT 


958 


Ala Cys 


Asn 


Trp 


Thr 


Arg 


Gly 


Glu 


Arg 


Cys 


Asp 


Leu 


Glu 


Asp 


Arg 


Asp 
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305 310 315 

AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GGT GAT CGA GGG 1006 
Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Gly Asp Arg Gly 
320 325 330 335 

CAG ACA CCA TCA CCA CCA TCA CTA AT AG 1034 
Gin Thr Pro Ser Pro Pro Ser Leu 
340 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

k f (ii) MOLECULE TYPE: protein 

yB (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

=lle Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
Sll 5 10 15 

ifl|u Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val 

tli 20 25 30 

Lsu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val 

k| 35 40 45 

Sir Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe 
C 50 55 60 

Pi 

E£r Pro Gly Ser Ala Gin Lys lie Gin Leu Val Asn Thr Asn Gly Ser 
5 65 70 75 80 

Trp His lie Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr 
85 90 95 

Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly 
100 105 110 

Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser lie Asp Lys Phe Ala Gin 
115 120 125 

Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg 
130 135 140 

Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly lie Val Pro Ala 
145 150 155 160 

Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
165 170 175 

Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala 
180 185 190 

Asn Asp Ser Asp Val Leu lie Leu Asn Asn Thr Arg Pro Pro Arg Gly 
195 200 205 



Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr 
210 215 220 

Cys Gly Gly Pro Pro Cys Asn lie Gly Gly Ala Gly Asn Asn Thr Leu 
225 230 235 240 

Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala 
245 250 255 

Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr 
260 265 270 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr lie Phe 
275 280 285 

Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala 
290 295 300 

G$s Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 

m5 310 315 320 

. 

^r Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Gly Asp Arg Gly Gin 
hj 325 330 335 

Sir Pro Ser Pro Pro Ser Leu 

m 340 

£2) INFORMATION FOR SEQ ID NO: 41: 

?T {i} SEQUENCE CHARACTERISTICS: 
Q (A) LENGTH: 945 base pairs 

U (B) TYPE: nucleic acid 

O <C) STRANDEDNESS : single 

f*& (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

{iii} ANTI-SENSE: NO 

(ix) FEATURE: 

{A} NAME /KEY: CDS 
(B) LOCATION: 1..942 

(ix) FEATURE: 

(A) NAME /KEY : matjpeptide 

(B) LOCATION: 1..939 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC 48 
Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
15 10 15 

GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT 96 
Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
20 25 30 



HCVENV.-T 



ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC 
Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys lie 
35 40 45 



144 



GAG CTC GTA AAC ACC AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG 
Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
50 55 60 



192 



AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAC 
Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
65 70 75 80 



240 



AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT 
Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 



288 



CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC ACT 
Arg Ser lie Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
O 100 105 110 



336 



■g&G CCT AAC AGC TCG GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT 
Slu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 

s.l 115 120 125 



384 



BSA CCG TGT GGT ATT GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT 
Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
130 135 140 



432 



S£C TTC ACC CCG AGC CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT 
feys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
535 150 155 160 



480 



SitC CCC ACG TAT AAC TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC 
ff^i Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
U 165 170 175 



528 



AAC AAC ACG CGG CCG CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG 
Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
180 185 190 



576 



AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC 
Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 



624 



GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG 
Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 
210 215 220 



672 



AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG 
Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 
225 230 235 240 



720 



ACA CCT AGG TGT ATG GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC 
Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 



768 



TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC 
Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 



816 



GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT 



864 



HCVENV.-T 



Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG 912 
Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

TCT ACA AGA GAG TGG CAG AGC TTA ATT AAT TAG 945 
Ser Thr Thr Glu Trp Gin Ser Leu lie Asn 
305 310 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
P (D) TOPOLOGY: linear 

yn (ii) MOLECULE TYPE: protein 

Js (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

|it Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
jWl 5 10 15 

Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 

U- 20 25 30 

fir Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 
35 40 45 

®n Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
H; 50 55 60 

Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
65 70 75 80 

Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 

Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 110 

Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 120 125 

Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
130 135 140 

Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
145 150 155 160 

Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
165 170 175 

Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
180 185 190 

Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 



1 ftzxnf T 



# 



Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 
210 215 220 

Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 
225 230 235 240 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

Cys Thr Val Asn Phe Thr lie Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
O 290 295 300 

Jer Thr Thr Glu Trp Gin Ser Leu He Asn 

Ijk) INFORMATION FOR SEQ ID NO: 43: 

fii (i) SEQUENCE CHARACTERISTICS: 
111 (A) LENGTH: 961 base pairs 

^ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



fad 



H ; (ii) MOLECULE TYPE: cDNA 
PI (iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..958 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..955 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC 48 
Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
15 10 15 

GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT 96 
Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
20 25 30 

ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC 144 
Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys lie 
35 40 45 

CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG 192 



Gin Leu Val Asn Thr Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu 
50 55 60 



AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAC 
Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
65 70 75 80 



240 



AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT 
Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 



288 



CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC ACT 
Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 110 



336 



GAG CCT AAC AGC TCG GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT 
Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 " 120 125 



384 



IpA CCG TGT GGT ATT GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT 
ftrg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
H 130 135 140 



432 



ftGC TTC ACC CCG AGC CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT 480 

SEys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 

;|45 150 155 160 

?GTC CCC ACG TAT AAC TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC 528 

U£al Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 

m 165 170 175 



l&AC AAC ACG CGG CCG CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG 
<gsn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 

IJ 180 185 190 



576 



AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC 
Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 



624 



GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG 
Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 
210 215 220 



672 



AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG 
Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 
225 230 235 240 



720 



ACA CCT AGG TGT ATG GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC 
Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 



768 



TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC 
Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 



816 



GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT 8 64 

Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG 912 
Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 



1 1/^* /Tin t T 



• 



290 295 300 

TCT ACA ACA GGT GAT CGA GGG CAG ACA CCA TCA CCA CCA TCA CTA A 958 
Ser Thr Thr Gly Asp Arg Gly Gin Thr Pro Ser Pro Pro Ser Leu 
305 310 315 

TAG 961 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

S3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

At Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
Ml 5 10 15 

Hty Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
W 20 25 30 

ilr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 
g 35 40 45 

Sin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 

Sin Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
35 70 75 80 

Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 

Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 110 

Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 120 125 

Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
130 135 140 

Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
145 150 155 160 

Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
165 170 175 

Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
180 185 190 

Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 

Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 

210 215 220 

Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 



225 230 235 240 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg- Leu Trp His Tyr Pro 
245 250 255 

Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

Ser Thr Thr Gly Asp Arg Gly Gin Thr Pro Ser Pro Pro Ser Leu 
305 310 315 

Q2) INFORMATION FOR SEQ ID NO: 45: 

;a (i) SEQUENCE CHARACTERISTICS: 
rt (A) LENGTH: 1395 base pairs 

„1 (B) TYPE: nucleic acid 

W (C) STRANDEDNESS : single 

Q (D) TOPOLOGY: linear 

¥% \ 

ip <ii) MOLECULE TYPE: cDNA 

? a (iii) HYPOTHETICAL: NO 

9 (iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1392 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..1389 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



ATG 


GTG GCG GGG GCC 


CAT 


TGG 


GGA 


GTC 


CTG 


GCG 


GGC 


CTC GCC TAC TAT 


48 


Met 


Val Ala Gly Ala 


His 


Trp 


Gly 


Val 


Leu 


Ala 


Gly 


Leu Ala Tyr Tyr 




1 


5 










10 






15 




TCC 


ATG GTG GGG AAC 


TGG 


GCT 


AAG 


GTT 


TTG 


GTT 


GTG 


ATG CTA CTC TTT 


96 


Ser Met Val Gly Asn 


Trp 


Ala 


Lys 


Val 


Leu 


Val 


Val 


Met Leu Leu Phe 






20 






25 








30 




GCC 


GGC GTC GAC GGG 


CAT 


ACC 


CGC 


GTG 


TCA 


GGA 


GGG 


GCA GCA GCC TCC 


144 


Ala 


Gly Val Asp Gly 


His 


Thr 


Arg 


Val 


Ser 


Gly 


Gly 


Ala Ala Ala Ser 






35 






40 










45 




GAT 


ACC AGG GGC CTT 


GTG 


TCC 


CTC 


TTT 


AGC 


CCC 


GGG 


TCG GCT CAG AAA 


192 


Asp 


Thr Arg Gly Leu 


Val 


Ser 


Leu 


Phe 


Ser 


Pro 


Gly 


Ser Ala Gin Lys 






50 




55 










60 






ATC 


CAG CTC GTA AAC 


ACC 


AAC 


GGC 


AGT 


TGG 


CAC 


ATC 


AAC AGG ACT GCC 


240 


He 


Gin Leu Val Asn 


Thr 


Asn 


Gly 


Ser 


Trp 


His 


He 


Asn Arg Thr Ala 




65 




70 










75 




80 





CTG AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC 288 
Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe 
85 90 95 

TAC AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC 336 
Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser 
100 105 110 

TGT CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC 384 
Cys Arg Ser lie Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr 
115 120 125 

ACT GAG CCT AAC AGC TCG GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG 432 
Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala 
130 135 140 

BbT CGA CCG TGT GGT ATT GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG 480 
ij&ro Arg Pro Cys Gly lie Val Pro Ala Ser Gin Val Cys Gly Pro Val 
gp5 150 155 160 

,fkr TGC TTC ACC CCG AGC CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT 528 
y^r Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe 
g 165 170 175 

ifeT GTC CCC ACG TAT AAC TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT 576 
dGly Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu lie 
y, 180 185 190 

WrC AAC AAC ACG CGG CCG CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG 624 
"ifeu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp 
O 195 200 205 

2rG AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC 672 
Set Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn 
210 215 220 

ATC GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT 720 
lie Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe 
225 230 235 240 

CGG AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG 768 
Arg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp 
245 250 255 

CTG ACA CCT AGG TGT ATG GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC 816 
Leu Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr 

260 265 270 

CCC TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG 864 
Pro Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly 
275 280 285 

GGC GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG 912 
Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu 
290 295 300 

CGT TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG 960 
Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu 
305 310 315 320 

CTG TCT ACA ACA GAG TGG CAG ATA CTG CCC TGT TCC TTC ACC ACC CTG 1008 



Leu Ser Thr Thr Glu Trp Gin He Leu Pro Cys Ser Phe Thr Thr Leu 
325 330 335 

CCG GCC CTA TCC ACC GGC CTG ATC CAC CTC CAT CAG AAC ATC GTG GAC 1056 
Pro Ala Leu Ser Thr Gly Leu He His Leu His Gin Asn He Val Asp 
340 345 350 

GTG CAA TAC CTG TAC GGT GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC 1104 
Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val Val Ser Leu Val He 
355 360 365 

AAA TGG GAG TAT GTC CTG TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC 1152 
Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg 
370 375 380 

ATC TGC GCC TGC TTA TGG ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC 1200 
He Cys Ala Cys Leu Trp Met Met Leu Leu He Ala Gin Ala Glu Ala 
SS5 390 395 400 

Sfc TTA GAG AAC CTG GTG GTC CTC AAT GCG GCG GCC GTG GCC GGG GCG 1248 
Ma Leu Glu Asn Leu Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala 
W 405 410 415 

||T GGC ACT CTT TCC TTC CTT GTG TTC TTC TGT GCT GCC TGG TAC ATC 1296 

Ifls Gly Thr Leu Ser Phe Leu Val Phe Phe Cys Ala Ala Tro Tyr lie 

Ul 420 425 430 

kkG GGC AGG CTG GTC CCT GGT GCG GCA TAC GCC TTC TAT GGC GTG TGG 1344 

f|s Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp 

hk 435 440 445 

$£G CTG CTC CTG CTT CTG CTG GCC TTA CCA CCA CGA GCT TAT GCC TAGTAA 1395 

Mo Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 

M< 450 455 460 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr 
15 10 15 

Ser Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe 
20 25 30 

Ala Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser 
35 40 45 

Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys 
50 55 60 

He Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala 
65 70 75 80 



Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe 
85 90 95 

Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser 
100 105 110 

Cys Arg Ser lie Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr 
115 120 125 

Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala 
130 135 140 

Pro Arg Pro Cys Gly lie Val Pro Ala Ser Gin Val Cys Gly Pro Val 
145 150 155 160 

Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe 
165 170 175 



Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu lie 
180 185 190 



Lieu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Tro 
195 200 205 



mt Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn 

y| 210- 215 220 

lie Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe 

b§5 230 235 240 

f^rg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp 
h 245 250 255 



Mu Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr 
^ 260 265 270 

Pro Cys Thr Val Asn Phe Thr lie Phe Lys Val Arg Met Tyr Val Gly 
275 280 285 

Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu 
290 295 300 

Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu 
305 310 315 320 

Leu Ser Thr Thr Glu Trp Gin lie Leu Pro Cys Ser Phe Thr Thr Leu 
325 330 335 

Pro Ala Leu Ser Thr Gly Leu lie His Leu His Gin Asn He Val Asp 
340 345 350 

Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val Val Ser Leu Val He 
355 360 365 

Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg 
370 375 380 

He Cys Ala Cys Leu Trp Met Met Leu Leu lie Ala Gin Ala Glu Ala 
385 390 395 400 



Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala 



ur\/CM\/ T 



405 



410 



415 



His Gly Thr Leu Ser Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr He 
420 425 430 

Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp 
435 440 445 

Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
450 455 460 



(2) INFORMATION FOR SEQ ID NO: 47: 

{i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 

(B) TYPE: nucleic acid 
^ (C) STRANDEDNESS : single 
'|f (D) TOPOLOGY: linear 

*J5j (ii) MOLECULE TYPE: cDNA 

kl (iii) HYPOTHETICAL: NO 

55 (iii) ANTI-SENSE: NO 

HI 

s (ix) FEATURE: 

yk (A) NAME /KEY: CDS 

Pf (B) LOCATION: 1..2079 

yl (ix) FEATURE: 

(A) NAME/ECEY: matjpeptide 

W (B) LOCATION: 1..2076 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



AAT TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 
Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

ICC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 



• 



TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 432 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 _ 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 480 
Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 
Sly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
J} 165 170 175 

IpG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 576 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser lie Tyr 

Jf 180 185 190 

ItibC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 

Iffro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 

~ ' 195 200 205 

J&G TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA 672 

"Sbp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 

H : 210 215 220 

@AA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG 720 

gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 

525 230 235 240 

GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT 768 
Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu Val 
245 . 250 255 

GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA 816 
Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val Ser Gly 
260 265 270 

GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC 864 
Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro 
275 280 285 

GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC 912 
Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser Trp His 
290 295 300 

ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC 960 
He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe 
305 310 315 320 

TTT GCC GCA CTA TTC TAC AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA 1008 
Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro 
325 330 335 

GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG 1056 



Glu Arg Leu Ala Ser Cys Arg Ser lie Asp Lys Phe Ala Gin Gly Trp 
340 345 350 

GGT CCC CTC ACT TAG ACT GAG CCT AAC AGC TCG GAC CAG AGG CCC TAC 1104 
Gly Pro Leu Thr Tyr Thr Glu Pro Asa Ser Ser Asp Gin Arg Pro Tyr 
355 360 365 

TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT ATT GTA CCC GCG TCT CAG 1152 
Cys Trp His Tyr Ala Pro Arg Pro Cys Gly lie Val Pro Ala Ser Gin 
370 375 380 

GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG AGC CCT GTT GTG GTG GGG 1200 
Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly 
385 390 395 400 

ACG ACC GAT CGG TTT GGT GTC CCC ACG TAT AAC TGG GGG GCG AAC GAC 1248 

Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp 

Pt 405 410 415 

SfcG GAT GTG CTG ATT CTC AAC AAC ACG CGG CCG CCG CGA GGC AAC TGG 1296 

Sir Asp Val Leu lie Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp 

ft 420 425 430 

SBjtC GGC TGT ACA TGG ATG AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG 1344 
ftie Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly 
m 435 440 445 

&lc CCC CCG TGC AAC ATC GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC 1392 
Sly Pro Pro Cys Asn lie Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys 
450 455 460 

CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC 1440 
fero Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys 
pEls 470 475 480 

fefeT TCT GGG CCC TGG CTG ACA CCT AGG TGT ATG GTT CAT TAC CCA TAT 1488 
Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr Pro, Tyr 
485 490 495 

AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT 1536 
Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr lie Phe Lys Val 
500 505 510 

AGG ATG TAC GTG GGG GGC GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT 1584 
Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn 
515 520 525 

TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG 1632 
Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu 
530 535 540 

CTT AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG CAG ATA CTG CCC TGT 1680 
Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin lie Leu Pro Cys 
545 550 555 560 

TCC TTC ACC ACC CTG CCG GCC CTA TCC ACC GGC CTG ATC CAC CTC CAT 1728 
Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His Leu His 
565 570 575 

CAG AAC ATC GTG GAC GTG CAA TAC CTG TAC GGT GTA GGG TCG GCG GTT 1776 
Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val 
580 585 590 



GTC TCC 


CTT 


GTC 


ATC 


AAA 


TGG GAG 


TAT 
xrlX 


GTC 


CTG TTG CTC TTC 


CTT 




10 Z 4 


vai ijcl 




v ax 


Tie 
X xc 




XX£J VJXU. 


iyr 


Vai 


Leu Leu Leu Phe 


Leu 


Leu 














wwv 




605 








PTG GCA 


GAC 


GCG 


CGC 


ATC 


TGC GCC 




TTA 


TGG ATG ATG CTG 


CTG 


ATA 


1 A79 


Lea Ala 


Asp 


Ala 


Arg 


He 


Cys Ala 


Cys 


Leu Trp Met Met Leu Leu 


He 




610 










615 






620 








GCT CAA 


GCT 


GAG 


GCC 


GCC 


TTA GAG 


AAC 


CTG 


GTG GTC CTC AAT 


GCG 


GCG 


1 Q^n 


Ala Gin 


Ala 


Glu 


Ala 


Ala 


Leu Glu 


Asn 


Leu 


Vai Vai Leu Asn 


Ala 


Ala 




625 








630 








635 




640 




GCC GTG 


GCC 


GGG 


GCG 


CAT 


GGC ACT 


CTT 


TCC 


TTC CTT GTG TTC 


TTC 


TGT 


1 jDO 


Ala Vai 


Ala 


Gly 


Ala 


His 


Gly Thr 


Leu 


Ser 


Phe Leu Vai Phe 


Phe Cys 










645 








650 




655 






GCT GCC 

Ww * Www 


TGG 


TAC 


ATC 


AAG 


GGC AGG 


CTG 


GTC 


CCT GGT GCG GCA 


TAC 


GCC 




?tla Ala 


Trp 


Tyr 


He 


Lys 


Gly Arg 


Leu 


Vai 


Pro Gly Ala Ala Tyr Ala 








660 








665 




670 








f¥c TAT 


GGC 


GTG 


TGG 


CCG 


CTG CTC 


CTG 


CTT 


CTG CTG GCC TTA 


CCA 


CCA 


2064 


Me Tyr 


Gly 


Vai 


Trp 


Pro 


Leu Leu 


Leu 


Leu 


Leu Leu Ala Leu 


Pro 


Pro 




-SB*! 


675 








630 






685 








i§A GCT 


TAT 


GCC 


TAGTAA 














2082 


iBg Ala 


Tyr 


Ala 





















690 



□ ) INFORMATION FOR SEQ ID NO: 48: 

m (i) SEQUENCE CHARACTERISTICS: 
S (A) LENGTH: 692 amino acids 

1^ (B) TYPE: amino acid 

- {D} TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 48: 

Asn Leu Gly Lys Vai He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
r l 5 10 15 

Vai Gly Tyr He Pro Leu Vai Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Vai Arg Vai Leu Glu Asp Gly Vai Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Vai Pro Ala Ser Ala Tyr Glu Vai Arg Asn Vai 
65 70 75 80 

Ser Gly Met Tyr His Vai Thr Asn Asp Cys Ser Asn Ser Ser He Vai 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Vai Pro Cys 
100 105 110 



♦ 



Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr lie 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asd Cys Asn Cys Ser lie Tyr 
180 185 * 190 

Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 



Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
210 215 220 



Mln Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
Jf25 230 235 240 

fUly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu Val 

||| 245 250 255 

= Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val Ser Gly 

{EI 260 265 270 

Hlly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro 
O 275 280 285 

o 

sJ&ly Ser Ala Gin Lys lie Gin Leu Val Asn Thr Asn Gly Ser Trp His 

r ' : " 290 295 300 

He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe 
305 310 315 320 

Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro 
325 330 335 

Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin Gly Trp 
340 345 350 

Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr 
355 360 365 

Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala Ser Gin 
370 375 380 

Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly 
385 390 395 400 

Thr Thr Asp Arg Phe' Gly Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp 
405 410 415 

Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp 
420 425 430 



Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly 



435 440 445 

Gly Pro Pro Cys Asn lie Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys 
450 455 460 

Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys 
465 470 475 480 

Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr Pro Tyr 
485 490 495 

Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He Phe Lys Val 

500 505 . 510 

Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn 
515 520 525 

3£cp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu 
^ 530 535 540 

g|u Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin He Leu Pro Cys 
SJ5 550 555 560 

I|r Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His Leu His 

5f 565 570 575 

fin Asn He Val Asp Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val 
a 580 585 590 

fe^l Ser Leu Val He Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu 

S 595 600 605 

filu Ala Asp Ala Arg He Cys Ala Cys Leu Trp Met Met Leu Leu He 

O 610 615 620 

Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala 

625 630 635 640 

Ala Val Ala Gly Ala His Gly Thr Leu Ser Phe Leu Val Phe Phe Cys 
645 650 655 

Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala 
660 665 670 

Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro 
675 680 635 

Arg Ala Tyr Ala 
690 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(BJ LOCATION: 1.. 24 30 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1..2427 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



ATG AGC ACG 
Met Ser Thr 


AAT 
Asn 


CCT 
Pro 
5 


AAA 
Lys 


CCT 
Pro 


CAA 
Gin 


AGA AAA 
Arg Lys 
10 


ACC AAA CGT AAC 
Thr Lys Arg Asn 


ACC 
Thr 
15 


cgc CGC CCA 
ft|g Arg Pro 


CAG 
Gin 
20 


GAC 
Asp 


GTC 
Val 


AAG 
Lys 


TTC 
Phe 


CCG GGC 
Pro Gly 
25 


GGT GGT CAG ATC 
Gly Gly Gin He 
30 


GTT 
Val 


©A GTT TAC 
fly Val Tyr 

m 35 


CTG 
Leu 


TTG 
Leu 


CCG 
Pro 


CGC 
Arg 


AGG 
Arg 
40 


GGC CCC 
Gly Pro 


AGG TTG GGT GTG 
Arg Leu Gly Val 
45 


CGC 
Arg 


|£T AGG AAG 
.ftr Arg Lys 
CI 50 


ACT 
Thr 


TCC 
Ser 


GAG 
Glu 


CGG 
Arg 
55 


TCG 
Ser 


CAA 
Gin 


CCT 
Pro 


CGT GGG AGG CGA 
Arg Gly Arg Arg 
60 


CAA 
Gin 


iQC CCC AAG 
fie Pro Lys 

S 5 


GCT 
Ala 


CGC 
Arg 


CGA 
Arg 
70 


CCC 
Pro 


GAG 
Glu 


GGT AGG 
Gly Arg 


GCC 
Ala 
75 


TGG GCT CAG 
Trp Ala Gin 


CCC 
Pro 


tac rpT Tf^rs 
Tyr Pro Trp 


CCC 

Pro 


CTP 

Leu 
85 


TAT 

Tyr 


Gly 


Asn 


GAG GGC 
Glu Gly 
90 


ATG 
Met 


GGG TGG GCA 
Gly Trp Ala 


Gly 
95 


CTC CTG TCA 
Leu Leu Ser 


CCC 
Pro 
100 


CGC 
Arg 


GGC 
Gly 


TCT 
Ser 


CGG 
Arg 


CCT 
Pro 
105 


AGT 
Ser 


TGG 
Trp 


GGC CCT ACA 
Gly Pro Thr 
110 


GAC 
Asp 


CGG CGT AGG 
Arg Arg Arg 
115 


TCG 
Ser 


CGT 
Arg 


AAT 
Asn 


TTG 
Leu 


GGT 
Gly 
120 


AAG 
Lys 


GTC 
Val 


ATC GAT ACC CTT 
He Asp Thr Leu 
125 


ACA 
Thr 


GGC TTC GCC 
Gly Phe Ala 
130 


GAC 
Asp 


CTC 
Leu 


GTG 
Val 


GGG 
Gly 
135 


TAC 
Tyr 


ATT 
He 


CCG 
Pro 


CTC 
Leu 


GTC GGC GCC 
Val Gly Ala 
140 


CCC 
Pro 


GGG GGC GCT 
Gly Gly Ala 
145 


GCC 
Ala 


AGG 
Arg 


GCC 
Ala 
150 


CTG 
Leu 


GCG 
Ala 


CAT 
His 


GGC 
Gly 


GTC CGG GTT CTG 
Val Arg Val Leu 
155 


GAG 
Glu 


GGC GTG AAC 
Gly Val Asn 


TAT 
Tyr 


GCA 
Ala 
165 


ACA 
Thr 


GGG 
Gly 


AAT 
Asn 


TTG 
Leu 


CCC 
Pro 
170 


GGT TGC TCT TTC 
Gly Cys Ser Phe 


TCT 
Ser 
175 



GGT 



80 



160 



TTC CTC TTG GCT TTG CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT 
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 



180 185 190 

GAA GTG CGC AAC GTG TCC GGG ATG TAG CAT GTC ACG AAC GAC TGC TCC 624 
Glu Val Arg Asn Val Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser 
195 200 205 

AAC TCA AGC ATT GTG TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC 672 
Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met lie Met His Thr Pro 
210 215 220 

GGG TGC GTG CCC TGC GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA 720 
Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 
225 230 235 240 

GCG CTC ACC CCC ACG CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG 768 
Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 
245 250 255 - 

|§A ATA CGA CGC CAC GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT 816 
tSr' lie Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys 
260 265 270 

WC GCT ATG TAC GTG GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC 864 
fir Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser 
m 275 280 285 

if! 

©G CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC 912 
<Sln Leu Phe Thr lie Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys 
290 295 300 

iBSKS. 

MT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG 960 
mn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg Met Ala Trp 
Jj5 310 315 320 

(IXr ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG 1008 
Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin 
325 330 335 

CTG CTC CGG ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT 1056 
Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His 
340 345 350 

TGG GGA GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG 1104 
Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp 
355 360 365 

GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT 1152 
Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His 
'370 375 380 

ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG 1200 
Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val 
385 390 395 400 

TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC 1248 
Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr 
405 410 415 

AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC 1296 
Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 



HPVPNV -T 



CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAG AAA CAC AAA TTC AAC 1344 
Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn 
435 440 445 

TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG 1392 
Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser lie Asp Lys 
450 455 460 

TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC ACT GAG CCT AAC AGC TCG 1440 
Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser 
465 470 475 480 

GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT ATT 1488 
Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly lie 
485 490 495 

g|TA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG AGC 1536 

TSal Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 

J 500 505 510 

/ifcT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT GTC CCC ACG TAT AAC 1584 

y#ro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn 
CI 515 520 525 

iJfGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC AAC AAC ACG CGG CCG 1632 

^frp Gly Ala Asn Asp Ser Asp Val Leu lie Leu Asn Asn Thr Arg Pro 
530 535 540 

BtCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG AAT GGC ACT GGG TTC 1680 
|*Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe 
ff 45 550 555 560 

HCC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC GGG GGG GCC GGC AAC 1728 
H?hr Lys Thr Cys Gly Gly Pro Pro Cys Asn lie Gly Gly Ala Gly Asn 
565 570 575 

AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG GCC 1776 
Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 
580 585 590 

ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG ACA CCT AGG TGT ATG 1824 
Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met 
595 600 605 

GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTC 1872 
Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe 
610 615 620 

ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC GTG GAG CAC AGG TTC 1920 
Thr lie Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe 
625 630 635 640 

GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC 1968 
Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp 
645 650 655 

AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG 2016 
Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 
660 665 670 



# 



TAft ATA 




ccc 




TCC 




acc 


ACC 


CTG 


CCG GCC CTA TCC 


ACC 


GGC 




Gin He 


Leu 


Pro 


Cys 


Ser 


Phe 


Thr 


Thr 


Leu 


Pro Ala Leu Ser Thr Gly 






675 










680 






685 










rap 




f*AT 




A AP 


T\*T*C 


GTG 


GAC 


GTG CAA TAC CTG 


TAC 


GGT 


Till 

2112 


Leu He 


His 


Leu 


His 


Gin 


Asn 


He 


Val Asp 


Val Gin Tyr Leu Tyr Gly 




690 










695 








700 












ccc 


Oil 




1U^/ 


Wl L 


GTC 


ATC 


AAA TGG GAG TAT 


GTC 


CTG 


£lOU 


Val Gly 


Ser 


Ala 


Val 


Val 


Ser 


Leu 


Val 


He 


Lys Trp Glu Tyr Val Leu 




705 








710 










715 




720 




T ,r T 1 ^ C*PC 










^PA 


car* 


GCG 


CGC 


ATC TGC GCC TGC 


TTA 


TGG 


*"S 0 A 0 

2208 


Leu Leu 


Phe 


Leu 


Leu 


Leu 


Ala 


Asp 


Ala Arg 


He Cys Ala Cys 


Leu Trp 










725 










730 




735 






Alb H.I o 






AiA 








GAG 


GCC 


GCC TTA GAG AAC 


CTG 


GTG 


2256 


Met Met 


Leu 


Leu 


He 


Ala 


Gin 


Ala 


Glu 


Ala 


Ala Leu Glu Asn 


Leu 


Val 








740 










745 




750 










AAT 


err, 


GCG 


GCC 






GGG 


GCG 


CAT GGC ACT CTT 


TCC 


TTC 


0 *3 n a 


Leu 


Asn 


Ala 


Ala 


Ala 


Val 


Ala 


Gly Ala 


His Gly Thr Leu 


Ser 


Phe 






755 










760 






765 








c^r GTG 


TTC 


TTC 


TGT 


GCT 


GCC 


TGG 


TAC 


ATC 


AAG GGC AGG CTG 


GTC 


CCT 


2352 


Val 


Phe 


Phe 


Cys Ala 


Ala 


Trp 


Tyr 


He 


Lys Gly Arg Leu Val 


Pro 




^ 770 










775 








780 








d#r gcg 


GCA 


TAG 


GCC 


TTC 


TAT 


GGC 


GTG 


TGG 


CCG CTG CTC CTG 


CTT 


CTG 


2400 


<Sk Ala 


Ala 


Tyr 


Ala 


Phe 


Tyr 


Gly 


Val 


Trp 


Pro Leu Leu Leu 


Leu 


Leu 












790 










795 




800 




Q^p GCC 


TTA 


CCA 


CCA 


CGA 


GCT 


TAT 


GCC 


TAGTAA 






2433 


Ala 


Leu 


Pro 


Pro Arg 


Ala 


Tyr 


Ala 


















805 










810 











(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 809 amino acids 
{B} TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 



lie Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gin Pro Gly 
65 70 75 80 



• 



Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Met Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val lie Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Val Gly Tyr lie Pro Leu Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie 
q 165 170 175 

u 5lhe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 

ft 180 185 190 

LMSlu Val Arg Asn Val Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser 
O 195 200 205 

ni 

^&sn Ser Ser lie Val Tyr Glu Ala Ala Asp Met lie Met His Thr Pro 
;^ 210 215 220 

FGly Cys Val Pr'o Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 
CI25 230 235 240 

Nfcla Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 

5? 245 250 255 

ij 

Hthr lie Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys 
260 265 270 

Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser 
275 280 285 

Gin Leu Phe Thr lie Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys 
* 290 295 300 

Asn Cys Ser lie Tyr Pro Gly His lie Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin 
325 330 335 

Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His 
340 345 350 

Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp 
355 360 365 

Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His 
370 375 380 

Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val 
385 390 395 400 



Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys lie Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys 
450 455 460 

Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser 
465 470 475 430 

Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He 
485 490 495 

*fal Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 
500 505 510 

^iro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn 

|ij 515 520 525 

3!rp Gly Ala Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro 
\% 530 535 540 

sPro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe 
|445 550 555 560 

3Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn 
565 570 575 

®sn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 
H 580 585 590 

Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met 

595' 600 605 

Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe 
610 615 620 

Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe 
625 630 635 640 

Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp 
645 650 * 655 

Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 
660 665 670 

Gin He Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 
675 680 685 

Leu He His Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly 
690 695 700 

Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val Leu 
705 710 715 720 



Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp 



Met 


Met 


Leu 


Leu 


lie 


Ala 


Gin 








740 








Val 


Leu 


Asn 


Ala 


Ala 


Ala 


Val 






755 










Leu 


Val 


Phe 


Phe 


Cys 


Ala 


Ala 




770 








775 


Gly Ala 


Ala 


Tyr 


Ala 


Phe 


Tyr 


785 










790 




Leu 


Ala 


Leu 


Pro 


Pro 


Arg 


Ala 










805 






INFORMATION 


FOR 


SEQ 


ID * 



725 730 735 

Ala Glu Ala Ala Leu Glu Asn Leu Val 
745 750 

Ala Gly Ala His Gly Thr Leu Ser Phe 
760 765 

Trp Tyr lie Lys Gly Arg Leu Val Pro 
780 

Gly Val Trp Pro Leu Leu Leu Leu Leu 

795 800 



W (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 amino acids 
y (B) TYPE: amino acid 

rt (C) STRANDEDNESS : single 

Sj (D) TOPOLOGY: linear 

y§ (ii) MOLECULE TYPE: peptide 



Q (ix) FEATURE: 

y- (A) NAME /KEY: Modif ied-site 



(B) LOCATION: 1..17 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Ser Asn Ser Ser Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys 
15 10 15 

Val 



{2} INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1..22 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Gly Gly He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 



HPVPNV-T 



10 15 



Ser Pro Thr Thr Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1..37 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



Tyr Glu Val Arg Asn Val Ser Gly lie Tyr His Val Thr Asn Asp Cys 
15 10 15 

Ser Asn Ser Ser lie Val Tyr Glu Ala Ala Asp Met lie Met His Thr 
20 25 30 

P Pro Gly Cys Gly Lys 

M= . 35 

^2) INFORMATION FOR SEQ ID NO: 54: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



{ix) FEATURE: 

{A) NAME /KEY: Modif ied-site 
(B) LOCATION: 1..25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Gly Gly Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Ala Thr 
15 10 15 

Gin Leu Arg Arg His lie Asp Leu Leu 
20 25 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 



HPVFNV-T 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1..25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 



Gly Gly Thr Pro Thr Leu Ala Ala Arg Asp Ala Ser Val Pro Thr Thr 
15 10 15 

Thr lie Arg Arg His Val Asp Leu Leu 
20 25 



^2). INFORMATION FOR SEQ ID NO: 56: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



m 



(ii} MOLECULE TYPE: peptide 



Sf (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

p 

Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Gin Val Arg Asn 
1 5 10 15 

Ser Thr Gly Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
15 10 15 

Asn Ser Ser lie 
20 

(2) INFORMATION FOR SEQ ID NO: 58: 



HCVFNV-T 



• 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Asn Asp Cys Pro Asn Ser Ser lie Val Tyr Glu Ala His Asp Ala lie 
15 10 15 

Leu His Thr Pro 
20 

Ii) INFORMATION FOR SEQ ID NO: 59: 

yB (i) SEQUENCE CHARACTERISTICS: 
%| (A) LENGTH: 20 amino acids 

ul (B) TYPE: amino acid 

2; (C) STRANDEDNESS: single 

■jf (D) TOPOLOGY: linear 

HI (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

O Ser Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr 
O 1 5 10 15 

Pro Gly Cys Val 
20 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 
{B} TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

His Asp Ala He Leu His Thr Pro Gly Val Pro Cys Val Arg Glu Gly 
15 10 15 

Asn Val Ser 

(2) INFORMATION FOR SEQ ID NO: 61: 
(i) SEQUENCE CHARACTERISTICS: 



HCVPNV.-T 



(A) LENGTH: 20 amino acids 

(S) TYPE: amino acid 

<C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Cys Val Arg Glu Gly Asn Val Ser Arg Cys Trp Val Ala Met Thr Pro 
15 10 15 

Thr Val Ala Thr 
20 

(A) INFORMATION FOR SEQ ID NO: 62: 

© (i) SEQUENCE CHARACTERISTICS: 
W (A) LENGTH: 20 amino acids 

Sf (B) TYPE: amino acid 

hi (C) STRANDEDNESS : single 

fSJ (D) TOPOLOGY: linear 

j| {ii} MOLECULE TYPE: peptide 



N| (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

EI Ala Met Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Ala Thr 
^ 15 10 15 

Gin Leu Arg Arg 
20 

{2} INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Leu Pro Ala Thr Gin Leu Arg Arg His lie Asp Leu Leu Val Gly Ser 
15 10 15 

Ala Thr Leu Cys 
20 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 20 amino acids 

(B) TYPE: amino acid 



HCVHNV.-T 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Leu Val Gly Ser Ala Thr Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu 
15 10 15 

Cys Gly Ser Val 
20 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 20 amino acids 

% (B) TYPE: amino acid 

(C) STRANDEDNESS: single 
JW (D) TOPOLOGY: linear 

y| (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Gin Leu Phe Thr Phe Ser Pro Arg Arg His Trp Thr Thr Gin Gly Cys 
1 5 10 15 

Asn Cys Ser lie 
20 

!) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Thr Gin Gly Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His 
15 10 15 

Arg Met Ala Trp 
20 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



HCVENV.-T 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

lie Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro 
15 10 15 

Thr Ala Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 20 amino acids 
O (B) TYPE: amino acid 

,f! (C) STRANDEDNESS: single 

|jj (D) TOPOLOGY: linear 

% l (ii) MOLECULE TYPE: peptide 



m (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

:\ Asn Trp Ser Pro Thr Ala Ala Leu Val Met Ala Gin Leu Leu Arg lie 
E 1 5 10 15 

M< Pro Gin Ala He 

i 20 

<S) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Leu Leu Arg He Pro Gin Ala He Leu Asp Met He Ala Gly Ala His 
15 10 15 

Trp Gly Val Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



HCVENV.-T 



(ii) MOLECULE TYPE: peptide 



{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Ala Gly Ala His Trp Gly Val Leu Ala Gly He Ala Tyr Phe Ser Met 
15 10 15 

Val Gly Asn Met 
20 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
? m (B) TYPE: amino acid 

<C) STRANDEDNESS : single 
V (D) TOPOLOGY: linear 

M (ii) MOLECULE TYPE: peptide 



5| (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

s Val Val Leu Leu Leu Phe Ala Gly Val Asp Ala Glu Thr He Val Ser 
H* 1 5 10 15 

Gly Gly Gin Ala 



fl) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Ser Gly Leu Val Ser Leu Phe Thr Pro Gly Ala Lys Gin Asn He Gin 
15 10 15 

Leu He Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



HCVENV.-T 



• 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Gin Asn He Gin Leu He Asn Thr Asn Gly Gin Trp His He Asn Ser 
15 10 15 

Thr Ala Leu Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 
0 (B) TYPE: amino acid 

yg (C) STRANDEDNESS : single 

±n (D) TOPOLOGY: linear 

\1i (ii) MOLECULE TYPE: peptide 



Fli (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Leu Asn Cys Asn Glu Ser Leu Asn Thr Gly Trp Trp Leu Ala Gly Leu 

y, 15 10 15 

W He Tyr Gin His Lys 

H : 20 

m INFORMATION FOR SEQ ID NO: 75: 

r * (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

, (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ala Gly Leu He Tyr Gin His Lys Phe Asn Ser Ser Gly Cys Pro Glu 
15 10 15 

Arg Leu Ala Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
(S) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 




(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 76; 

Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Pro Leu Thr Asp Phe Asp 
15 10 15 

Gin Gly Trp Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 77; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

CO STRANDEDNESS: single 
m (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



o 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

m 

Thr Asp Phe Asp Gin Gly Trp Gly Pro lie Ser Tyr Ala Asn Gly Ser 
f 1 5 10 15 

H 8 Gly Pro Asp Gin 

0 20 

INFORMATION FOR SEQ ID NO: 78: 

H (i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Ala Asn Gly Ser Gly Pro Asp Gin Arg Pro Tyr Cys Trp His Tyr Pro 
15 10 15 

Pro Lys Pro Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

£A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Trp His Tyr Pro Pro Lys Pro Cys Gly lie Val Pro Ala Lys Ser Val 
1*5 10 15 

Cys Gly Pro Val 
20 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Jl (ii) MOLECULE TYPE: peptide 



jyyj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

n I Ala Lys Ser Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val 

ill 1 5 10 15 

* Val Val Gly Thr 

H 1 20 

(jS) INFORMATION FOR SEQ ID NO: 81: 

m. 

3T (i) SEQUENCE CHARACTERISTICS: 
^ ^ (A) LENGTH: 20 amino acids 
m * (B} TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii). MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Ser Gly Ala Pro Thr 
15 10 15 

Tyr Ser Trp Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



{xi) SEQUENCE DESCRIPTION; SEQ ID NO: 82; 

Gly Ala Pro Thr Tyr Ser Trp Gly Glu Asn Asp Thr Asp Val Phe Val 
15 10 15 

Leu Asn Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: peptide 



SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

yjf Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys 
Q 1 5 10 * 15 

^ Val Cys Gly Ala 

m 20 

it, 

f#) INFORMATION FOR SEQ ID NO: 84: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 





(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Gly Phe Thr Lys Val Cys Gly Ala Pro Pro Val Cys He Gly Gly Ala 
15 10 15 

Gly Asn Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



lie Gly Gly Ala Gly Asn Asn Thr Leu His Cys Pro Thr Asp Cys Arg 
15 10 15 

Lys His Pro 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

*Ts Thr Asp Cys Phe Arg Lys His Pro Asp Ala Thr Tyr Ser Arg Cys Gly 
s^l 5 10 15 

Q Ser Gly Pro Trp 

fU 20 

y2) INFORMATION FOR SEQ ID NO: 87: 

5? (i) SEQUENCE CHARACTERISTICS: 
W (A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
CI (C) STRANDEDNESS: single 

Q (D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Ser Arg Cys Gly Ser Gly Pro Trp lie Thr Pro Arg Cys Leu Val Asp 
15 10 15 

Tyr Pro Tyr Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr lie 



10 15 



Asn Tyr Thr lie 
20 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

m Pro Cys Thr He Asn Tyr Thr He Phe Lys He Arg Met Tyr Vai Gly 
yy 1 5 10 15 

s .1 Gly Vai Glu His 
20 

111) INFORMATION FOR SEQ ID NO: 90: 

r (i) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 20 amino acids 
lm (B) TYPE: amino acid 

H (C) STRANDEDNESS: single 

£ (D) TOPOLOGY: linear 

p (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Met Tyr Vai Gly Gly Vai Glu His Arg Leu Glu Ala Ala Cys Asn Trp 
15 10 15 

Thr Pro Gly Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Ala Cys Asn Trp Thr Pro Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp 
15 10 15 
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Arg Ser Glu Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

^ Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Thr Thr Thr 
N 1 5 10 15 

£5 Gin Trp Gin Val 

if 20 

tl) INFORMATION FOR SEQ ID NO: 93: 

H (i) SEQUENCE CHARACTERISTICS : 
O (A) LENGTH: 9 amino acids 

y>. (B) TYPE: amino acid 

h <C) STRANDEDNESS: single 

5| (D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
Tyr Gin Val Arg Asn Ser Thr Gly Leu 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
ACGTCCGTAC GTTCGAATTA ATTAATCGA 



29 



(2} INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
{iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



r| <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

j^TCCGGACG TGCACTAGCT CCCGTCTGTG GTAGTGGTGG TAGTGATTAT CAATTAATTG 60 



p) INFORMATION FOR SEQ ID NO: 96: 

lm (i) SEQUENCE CHARACTERISTICS: 
y - (A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
H 5 (C) STRANDEDNESS: single 

PI (D) TOPOLOGY: linear 

C (ii) MOLECULE TYPE: DNA (genomic) 

S (iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GTTTAACCAC TGCATGATG 19 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
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GTCCCATCGA GTGCGGCTAC 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



CI (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

SgTGACATGG TACATTCCGG ACACTTGGCG CACTTCATAA GCGGA 

||) INFORMATION FOR SEQ ID NO: 99: 

flj (i) SEQUENCE CHARACTERISTICS: 

HI (A) LENGTH: 4 2 base pairs 

(B) TYPE: nucleic acid 

T, (C) STRANDEDNESS: single 

U (D) TOPOLOGY: linear 
U 

N" (ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
TGCCTCATAC ACAATGGAGC TCTGGGACGA GTCGTTCGTG AC 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 



fACCCAGCAG CGGGAGCTCT GTTGCTCCCG AACGCAGGGC AC 
(2) INFORMATION FOR SEQ ID NO: 101: 

• (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



O (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

'sissy- 

TlTCGTGGTG GGGACGGAGG CCTGCCTAGC TGCGAGCGTG GG 
C8J INFORMATION FOR SEQ ID NO: 102: 

Sf {i} SEQUENCE CHARACTERISTICS: 
5f! (A) LENGTH: 48 base pairs 

V4 (B) TYPE: nucleic acid 

U1 (C) STRANDEDNESS: single 

p (D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: DNA (genomic) 

W (iii) HYPOTHETICAL: NO 

p (iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
CGTTATGTGG CCCGGGTAGA TTGAGCACTG GCAGTCCTGC ACCGTCTC 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
CAGGGCCGTT CTAGGCCTCC ACTGCATCAT CATATCCCAA GC 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 



CCGGAATGTA CCATGTCACG AACGAC 26 
ft) INFORMATION FOR SEQ ID NO: 105: 

%c 

m (i) SEQUENCE CHARACTERISTICS: 
\i (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
S! (C) STRANDEDNESS: single 

Jjj (D) TOPOLOGY: linear 

ill 

fJ! (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

H (iii) ANTI-SENSE: NO 



^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

GCTCCATTGT GTATGAGGCA GCGG 24 



(2) INFORMATION FOR SEQ ID NO: 106: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
GAGCTCCCGC TGCTGGGTAG CGC 23 
(2) INFORMATION FOR SEQ ID NO: 107: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
CCTCCGTCCC CACCACGACA ATACG 25 
(2) INFORMATION FOR SEQ ID NO: 108: 

3 

,f! (i) SEQUENCE CHARACTERISTICS: 
,i (A) LENGTH: 27 base pairs 

ni' (B) TYPE: nucleic acid 

,1 (C) STRANDEDNESS: single 

W (D) TOPOLOGY: linear 

rU (ii) MOLECULE TYPE: DNA (genomic) 

in 

J* (iii) HYPOTHETICAL: NO 
•:IT (iii) ANTI-SENSE: NO 



Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

5TACCCGGGC CACATAACGG GTCACCG 27 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

GGAGGCCTAC AACGGCCCTG GTGG 24 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
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■(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
{iii) ANT I -SENSE: NO 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

TTCTATCGAT TAAATAGAAT TC 22 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 
O (A) LENGTH: 23 base pairs 

>f% (B) TYPE: nucleic acid 

In (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

W (ii) MOLECULE TYPE: DNA (genomic) 

ftf (iii) HYPOTHETICAL: NO 

]f (iii) ANTI-SENSE: NO 

U. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

(pfCATACGCT CACAGCCGAT CCC 23 
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